Application No.: 10/091,007 
\ Amdt. Dated: Feb. 2, 2004 

Reply to Office Action of Jan. 2, 2004 



Amendments to the Drawings: 

The attached sheet of drawings includes changes to Figs. 1, 3, and 4. This sheet, which includes 
Figs. 1-16, replaces the original sheet including Figs. 1-16. In Figs. 1, 3, and 4, previously 
omitted SEQ ID NO:s have been added. 



Attachment: Replacement Sheet 

Annotated Sheet Showing Changes 
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Application No. : 1 0/09 1 ,007 

Amdt. Dated: Feb. 2, 2004 

Reply to Office Action of Jan. 2, 2004 



REMARKS/ARGUMENTS 



In the specification, the paragraph on page 27, beginning with line 20, has been amended 
to add the previously omitted SEQ ID NOs to each nucleotide sequence that is more than ten 
nucleotides in length. Appendices I and II have been replaced with new Appendices I and n, 
which merely add the previously omitted SEQ ID NOs to each nucleotide sequence that is more 
than ten nucleotides in length. 

In amended Figure 1, the previously omitted SEQ ID NOs have been added to each 
nucleotide sequence more than ten nucleotides in length and to each amino acid sequences more 
than four amino acids in length. Similarly, in amended Figures 3 and 4, the previously omitted 
SEQ ID NOs have been added to each nucleotide sequence more than ten nucleotides in length. 

Applicants respectfully request entrance of the above amendments. In view of the above 
remarks, early notification of a favorable consideration and allowance of all claims is 
respectfully requested. 

If after this amendment there are issues remaining which discussion could advance 
prosecution, Applicants respectfully request that the examiner call the undersigned attorney at 
the phone number listed. 



HUNTON & WILLIAMS 
1900 K Street, N.W. 
Suite 1200 

Washington, D.C. 20006-1 109 
Telephone: (202) 955-1500 
Facsimile: (202) 778-2201 

Attachments 



Dated: February 2, 2004 



By: 



Laurence H. PWorske 
Registration No. 34,698 
David A. Kelly 
Registration No. 53,106 




62785.000002 WASHINGTON 397089vl 
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Annotated Sheet Showing Changes 



- FIGURE 1 - 
ID-65 

Clone 3-60 

GTGTTTATGATGAAAAAAGGACAAGTAAATGATACTAAGCAA 
TCTTACTCTCTACGTAAATATAAATTTGGTTTAGCATCAGTAA 
TTTTAGGGTCATTCATAATGGTCACAAGTCCTGTTTTTGCGGA 
TCAAACTACATCGGTTCAAGTTAATAATCAGACAGGCACTAG 
TGTGGATGCTAATAATTCTTCCAATGAGACAAGTGCGTCAAGT 
GTGATTACTTCCAATAATGATAGTGTTCAAGCGTCTGATAAAG 
TTGTAAATAGTCAAAATACGGCAACAAAGGACATTACTACTC 
CTTTAGTAGAGACAAAGCCAATGGTGGAAAAAACATTACCTG 
AACAAGGGAATTATGTTTATAGCAAAGAAACCGAGGTGAAAA 
ATACACCTTCAAAATCAGCCCCAGTAGCTTTCTATGCAAAGA 
AAGGTGATAAAGTTTTCTATGACCAAGTATTTAATAAAGATA 
ATGTGAAATGGATTTCATATAAGTCTTTTGGTGGCGTACGTCG 
ATACGCAGCTATTGAGTCACTAGATCCATCAGGAGGTTCAGA 
GACTAAAGCACCTACTCCTGTAACAAATTCAGGAAGCAATAA 
TCAAGAGAAAATAGCAACGCAAGGAAATTATACATTTTCACA 
TAAAGTAGAAGTAAAAAATGAAGCTAAGGTAGCGAGTCCAA 
CTCAATTTACATTGGACAAAGGAGACAGAATTTTTTACGACC 
AAATACTAACTATTGAAGGAAATCAGTGGTTATCTTATAAATC 
ATTCAATGGTGTTCGTCGTTTTGTTTTGCTAGGTAAAGCATCTT 
CAGTAGAAAAAACTGAAGATAAAGAAAAAGTGTCTCCTCAAC 
CACAAGCCCGTATTACTAAAACTGGTAGACTGACTATTTCTAA 
CGAAACAACTACAGGTTTTGATATTTTAATTACGAATATTAAA 
GATGATAACGGTATCGCTGCTGTTAAGGTACCGGTTTGGACTG 
AACAAGGAGGGCAAGATGATATTAAATGGTATACAGCTGTAA 
CTACTGGGGATGGCAACTACAAAGTAGCTGTATCATTTGCTG 
ACCATAAGAATGAGAAGGGTCTTTATAATATTCATTTATACTA 
CCAAGAAGCTAGTGGGACACTTGTAGGTGTAACAGGAACTAA 
AGTGACAGTAGCTGGAACTAATTCTTCTCAAGAACCTATTGA 
AAATGGTTTACCAAAGACTGGTGTTTATAATATTATCGGAAGT 
ACTGAAGTAAAAAATGAAGCTAAAATATCAAGTCAGACCCAA 
TTTACTTTAGAAAAAGGTGACAAAATAAATTATGATCAAGTA 
TTGACAGCAGATGGTTACCAGTGGATTTCTTACAAATCTTATA 
GTGGTGTTCGTCGCTATATTCCTGTGAAAAAGCTAACTACAAG 
TAGTGAAAAAGCGAAAGATGAGGCGACTAAACCGACTAGTTA 
TCCCAACTTACCTAAAACAGGTACCTATACATTTACTAAAACT 
GTAGATGTGAAAAGTCAACCTAAAGTATCAAGTCCAGTGGAA 
TTTAATTTTCAAAAGGGTGAAAAAATACATTATGATCAAGTGT 
TAGTAGTAGATGGTCATCAGTGGATTTCATACAAGAGTTATTC 
CGGTATTCGTCGCTATATTGAAATTTAA 



MFMMKKGQVNDTKQSYSLRKYKFGLASVILGSFIMVTSPVFADQTTSVQVNN 
QTGTSVDANNSSNETSASSVITSNNDSVQASDKVVNSQNTATKDITTPLVETK 
5 PMVEKTLPEQGNYVYSKETEVKNTPSKSAPVAFYAKKGDKVFYDQVFNKDN 
VKWISYKSFGGVRRYAAffiSLDPSGGSETKAPTPVTNSGSNNQEKIATQGNYT 
FSHKVEVKNEAKVASPTQFTLDKGDRIFYDQILTIEGNQWLSYKSFNGVRRFV 
LLGKASSVEKTEDKEKVSPQPQARITKTGRLTISNETTTGFDILITNIKDDNGIA 
AVKVPVWTEQGGQDDIKWYTAVTTGDGNYKVAVSFADHKNEKGLYNIHLY 
1 0 YQEASGTLVG VTGTKVTVAGTNSSQEPIENGLPKTGV YNIIGSTEVKNEAKISS 
QTQFTLEKGDKINYDQVLTADGYQWISYKSYSGVRRYIPVKKLTTSSEKAKDE 
ATKPTSYPNLPKTGTYTFTKTVDVKSQPKVSSPVEFNFQKGEKfflYDQVLVVD 
GHQWISYKSYSGIRRYIEI* 

15 

Sequence description 

A) Length: 1642 bp - 547 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 

20 Orf is preceded by a potential Shine- 

Dalgamo sequence. 

ID-66 

25 

Clone 3-5 
(S6Q. x:D AJ0-.5) 

ATGATATTGAGACGTCGAACTATTGTTTTATGGCAACTGGGTATCGCCATT 
TCTCTCATTCTTAGTATTCTAGCCTTAAATCTTTATTTCCATAGTACTCCCTT 

30 GCAAACCAATGCAGCTTTACGGAACCTTGCTCCTTCATTAAACCATCTTTTT 
GGGACAGATGGTTTAGGTAGGGATATGTTTGTCAGAACGATTAAAGGACT 
TTATTTCTCTCTACAAGTCGGCTTATTAGGTGCCCTTATGGGGGTCATTCTG 
GCGACAGTTTTTGGAGTGCTTGCAGGTTTAGGAAATAGCATTATTGATAAA 
ATAATAGCATGGTTAGTTGATTTGTTTATTGGTATGCCTCATTTGATTTTTA 

35 TGATTCTCATTTCTTTTGTTGTTGGGAAAGGTGCTCAAGGGGTCATCATTGC 
AACGGCTGTTACACATTGGCCTTCTTTAGCAAGGCTTATCCGCAATGAAGT 
CTATCATCTAAAGAATAAAGAATTTGTCCAACTTTCTAAAAGTATGGGAAA 
AACGCCTTATTATATTGTGAGGCATCATATCCTGCCTTTGATTGCTTCTCAA 
ATTTTCATTGGTTTTATCCTCTT ATTTCC AC ATGTC ATCCTACATGA AGC AT 

40 CAATGACTTTCTTAGGATTTGGGCTCTCTGCCGAACAACCTTCGGTTGGTA 
TCATTCTGTCAGAGGCAGCTAAGCATATCTCTCTTGGAAATTGGTGGTTGG 
TTATCTTTCCAGGACTTTATCTTATTTTGGTTGTCAATGCATTTGATACTAT 
CGGAGAATCTTTAAAGAAACTCTTTTACCCTCAAACTGATCATTTTTAG 



XO /OO: M> 

MILRRRTIVLWQLGIAISLILSILALNLYFHSTPLQTNAALRNLAPSLNHLFGTD 
GLGRDMFVRTIKGLYFSLQVGLLGALMGVILATVFGVLAGLGNSIIDKIIAWL 
VDLFIGMPHLIFMILISFVVGKGAQGVIIATAVTHWPSLARLIRNEVYHLKNKE 
5 FVQLSKSMGKTPYYIVRHHILPLIASQIFIGFILLFPHVILHEASMTFLGFGLSAE 
QPSVGIILSEAAKHISLGNWWLVIFPGLYLILVVNAFDTIGESLKKLFYPQTDHF 



10 Sequence description 

A) Length: 822 bp - 274 aa (full length gene) 

B) Sequence Characteristics: 
Potential leader peptide sequence 

15 Orf is preceded by a potential Shine- 

Dalgarno sequence. 

ID-78 

20 

Clone 3-5b 

ATGACAGAAACATTATTAAGCATTAAAGACCTCTCCATCACCTTCACTCAA 
TACGGAAGATTTTTAAAACCATTTCAATCAACACCGATACAAGCGCTGA 

25 ATTTAGAAATTAAAAAAGGTGAGTTATTAGCTATTATAGGTGCTAGTGGTT 
CGGGGAAGAGTTTATTAGCACATGCTATTATGGATATTCTTCCTAAAAATG 
CATCTGTAACAGGAGATATGATTTATCGTGGTCAATCACTAAATTCTAAAC 
GCATTAAACAGTTGCGAGGAAAAGATATTACGTTGATTCCACAATCAGTTA 
ATTATTTAGATCCATCTATGAAAGTCAAACATCAGGTGCGCTTAGGTATCT 

3 0 C AG A A AATTC A A AGGCT ACTC AAG AAGG ATTGTTTC AAC AGTTTGGTTT A A 
AAGAAAGTGATGGTGACTTGGATCCTTTCCAACTTTCTGGCGGAATGCTCC 
GACGTGTTTTGTTTACAACGTGTATTAGTGATAAGGTTTCTTTGATTATTGC 
GGATGAGCCCACCCCTGGATTACATCCAGATGCTCTGCAAATGGTTTTAGA 
CCAACTACGCTCCTTTGCAGATAAAGGAATAAGCGTTATATTTATCACTCA 

35 TGATATTGTAGCAGCTAGTCAAATTGCTGATCGTATTACTATTTTTAAAGA 
GGGAAAAGCTATTGAAACAGCTCCAGCTAGTTTCTTTAGCGGAAATGGAG 
AGCAGTTACAAACAGAATTTGCTAGAAGTTTATGGCGCTCTCTCCCACAGC 
AAGAATTTTTGAAAGGAGTTACTCATGACCTTAGAGGCTAA 

40 MTETLLSIKDLSITFTQYGRFLKPFQSTPIQALNLEIKKGELLAnGASGSGKSLL 
AHAIMDILPKNASVTGDMIYRGQSLNSKRIKQLRGKDITLIPQSVNYLDPSMK 
VKHQVRLGISENSKATQEGLFQQFGLKESDGDLDPFQLSGGMLRRVLFTTCIS 
DKVSLIIADEPTPGLHPDALQMVLDQLRSFADKGISVIFITHDIVAASQIADRITI 
FKEGKAIETAPASFFSGNGEQLQTEFARSLWRSLPQQEFLKGVTHDLRG* 



Sequence description 

5 

A) Length: 804 bp - 268 aa (full length gene) 

B) Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
10 Dalgarno sequence. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified downstream and fully sequenced. 



15 



ID-79 



Clone 3-5c 

20 1 550. 10 NO 1 - 

GTCCATCTGGGGTGGTTCCCGATTGGTATTTCTTCTCCGATAGGTACTTTGA 

GTCAAGATATTACGTTAGCTGATCGTATTAAGCACCTTATTTTACCTGTTTT 

CACGGTAAGTATTCTAGGCATTGCCAATGTAACTCTTCATACTAGAACTAA 

AATGATGTCGGTACTTTCTAGTGAATATGTCTTATTTGCCAGAGCGCGTGG 

25 GGAAACGGAATGGCAAATTTTTAAAAATCATTGTCTTAGAAATGCTATCGT 
ACCAGCTATTACACTGCATTTTTCCTATTTTGGAGAATTGTTTGGAGGATCC 
GTTCTTGCTGAGCAAGTTTTCTCATATCCAGGACTAGGGTCTACCCTAACT 
GAAGCAGGACTTAAAAGTGATACACCGCTACTTCTAGCTATTGTGATGATA 
GGGACATTATTTGTTTTTGCGGGCAATCTTATTGCGGATATTTTAAATAGC 

30 ATAATCAATCCACAGTTAAGGAGAAAAGTATGA 

(sza -xo wo: %\ 

VHLGWFPIGISSPIGTLSQDiTLADRIKHLILPVFTVSILGIANVTLHTRTKMMSV 
LSSEYVLFARARGETEWQIFKNHCLRNArVPAITLHFSYFGELFGGSVLAEQVF 
SYPGLGSTLTEAGLKSDTPLLLAIVMIGTLFVFAGNLIADILNSIINPQLRRKV* 

35 

Sequence description 



A) Length: 495 bp - 165 aa (partial gene sequence) 

B) Sequence Characteristics: 

40 N-terminus has yet to be determined. 

This gene was not isolated using the LEEP 
system. However in determining a full length 
gene sequence for ID-76, this gene was 
identified upstream. 



ID-80 



5 

Clone 2-17 

ttg^gga^aaVtacgttcaaacacaatgaaacgcgatcgtcaaaaagcga 
aggtagggcggtaatgcttaaaagattatttactgaagatggggaattga 

1 0 ca aag attagtcgtcgtttcgtttgg atgttagtggttatctattgtctt at 
tattgtcaggatgtgttttgggcctcaaattatgattgagggggtatcaac 
tccgaatgttcagcgcttcggaagaattgtagctcttttagtaccatttaa 
ttcttttcgtagtttagatcagctaactagctttaaagagattttttgggtt 
attggtcaaaatgtagtgaatattttactgctgtttcctctcattatagggt 

1 5 tactatcccta aagcc aagtttacgga a at at a aaagcgttat attacttg 
ctttcttgatgtctcttttcatagagtgtactcaagttgttttagatatttt 
aatagatgctaatcgggtttttgaaatcgacgatctatggacaaatacctt 
aggcggtcctttcgccctatggagttatcgaaacataaaaggttggcttct 
aactattagaaaatga 

20 (3&l w w- to) 

MRTITFKHNETRSSKSEGRAVMLKRLFTEDGELTKISRRFVWMLVVIYCLUVR 
MCFGPQ1MIEGVSTPNVQRFGRIVALLVPFNSFRSLDQLTSFKEIFWVIGQNVV 
NILLLFPLIIGLLSLKPSLRKYKSVILLAFLMSLFIECTQVVLDILIDANRVFEIDD 
LWTNTLGGPFALWSYRNIKGWLLTIRK* 

25 

Sequence description 

A) Length: 579 bp - 193 aa (full length gene) 
30 B) Sequence Characteristics: 

Possesses a potential leader peptide sequence 

No obvious Shine-Dalgarno, but the 'TTG' codon 

may not be the actual translation start point. 

A methionine (ATG) that occurs -22 codons 
35 downstream of the 'TTG' is preceded by a 

potential Shine-Dalgarno sequence and may 

represent the actual start codon. 

ID81 

40 



Clone 3-1 



ttgaaaaXtttaaatcgttatgtagttgcggtttctggagtcgttttacatt 
taatgctaggatcaacttatgcttggagtgtgtttcgtaacccaattatct 
cagagactggttgggatatttcatcagtttcattcgcttttagtttggctat 
tttttgtctaggaatgtctgcagcttttatgggacacttagtagagcgtttt 
5 ggtcctaggataatgggaatgatttctgctattttatatggagcagggaat 
gtgttaacaggcttagccattgaaactcagcagttatggttactgtatgtt 
gcatacggtattttaggaggaatcggacttggttcaggttatattactcca 
gtatcgactattattaaatggtttcctgataggaggggactagcaacagg 

ATTCGCTATTATGGGATTTGGCTTTGCTTCTTTAGTAACAAGTCCGCTTGCA 
1 0 C AATCCTT ACTG ATTAGG ATTGGTGTGGGT AA A ACGTTTTAT ATTTTGGG A 
TTAGTATATTTTTTTGTCATGATGATTGCCTCACAATTTATTAAACAACCAC 
CTCAGGAAAAAATAAGTATTTTGACTCACGATGGTAAAAAGAATGCTATG 
AATTCACAAATTATCACTGGATTAAAAGCAAACGTCGCTATAAAATCAAA 
AACCTTTTACATCATTTGGTTGACCTTGTTTATTAATATTTCGTGTGGCTTA 
1 5 GGTTT AATATC AGCAGCTTC ACC AATGGC AC AAG ATTTAGC AGGCT ATTCC 
GCAGAATCTGCAGCCTTATTAGTAGGGGTACTAGGGATATTTAACGGTTTT 
GGACGTCTGTTATGGGCAAGTCTCTCTGACTACATTGGACGCCCGTTGACC 
TTTATAATATTATTTATTGTGAACTTTATTATGACTTCTAGTTTATTTTTGTC 
ATTCAATGCTATTGTATTTGCAATAGCGATGTCTATTTTAATGACTTGTTAT 
20 GGTGCAGGTTTTTCCTTATTACCTGCTTATCTAAGTGATATTTTTGGAACAA 
AGGAATTAGCTACTTTACATGGTTATAGTTTAACAGCATGGGCAATAGCAG 
GTCTGTTTGGGCCCCTATTGTTATCAAAGACATATTCATGGGGAAATTCCT 
ATCAATTGACATTAATGGTTTTTGGTTTTTTATTCTTATTCGGATTATTGTTA 
TCTCTATATTTAAGAAAATTAACAACTAAAGTTGTGTAG 

LKNLNRYVVAVSGVVLHLMLGSTYAWSVFRNPIISETGWDISSVSFAFSLAIFC 
LGMSAAFMGHLVERFGPRIMGMISAILYGAGNVLTGLAIETQQLWLLYVAYG 
ILGGIGLGSGYITPVSTIIKWFPDRRGLATGFAIMGFGFASLVTSPLAQSLLIRIG 
VGKTFYILGLVYFFVMMIASQFIKQPPQEKITILTHDGKKNAMNSQnTGLKAN 
30 VAIKSKTFYDWLTLFINISCGLGLISAASPMAQDLAGYSAESAALLVGVLGIFN 
GFGRLLWASLSDYIGRPLTFIILFIVNFIMTSSLFLSFNArVFAIAMSILMTCYGA 
GFSLLPAYLSDIFGTKELATLHGYSLTAWAIAGLFGPLLLSKTYSWGNSYQLTL 
MVFGFLFLFGLLLSLYLRKLTTKVV* 

35 

Sequence description: 

A] Length 1221 bp - 407 a.a (full length 
gene). 

40 B] TTG start codon with Shine-Dalgarno 

sequence upstream. Obvious signal peptide, 
with hydropathy plot exhibiting many possible 
membrane spanning regions, indicating protein 
to be transmembrane. 
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ID-82 



Clone 48 

1 0 ATGGC AG AT A A A AAC AG AAC ATTTAAACTTGTAGGTGC AGGATCTTCTAG 
CACACAAGAAAAAATTGAAAAGCCTGCTCTTTCGTTTATGCAAGATGCGTG 
GCGTCGCTTGAAAAAAAACAAATTAGCAGTAGTTTCACTCTATTTATTAGC 
TCTTTTACTTACTTTTTCGTTAGCCTCAAATTTATTTGTAACTCAGAAGGAT 
GCTAATGGGTTTGATTCGAAAAAAGTAACGACATATCGCAACTTACCACCT 

1 5 AAATTG AGTTC A AACCTTCCTTTTTGGAATGGTAGC ATT A ATCCATC A 

MADKNRTFKLVGAGSSSTQEKIEKPALSFMQDAWRRLKKNKLAVVSLYLLA 
LLLTFSLASNLFVTQKDANGFDSKKVTTYRNLPPKLSSNLPFWNGSINPS 

20 

Sequence description: 

A] Current length is 303 bp - 101 aa 

B] No obvious signal peptide but Shine 

25 Dalgarno sequence upstream of the ATG start 

codon. Not ide3ntified directly using the LEEP system but was found 
directly downstream of ID-34 described in WO 00/06736. 

y 

30 ID-83 



Clone 98 

35 (<£k_%0 NO'- 15) 

ATGAAAATAGTAGTACCAGTAATGCCTCGCAGTCTTGAAGAGGCTCAAGA 

AATAGATTTATCAAAATTTGATAGTGTTGATATTATTGAATGGCGAGCTGA 

TGCCTTACCAAAGGATGACATTATTAATGTAGCTCCAGCTATTTTTGAGAA 

ATTCGCAGGTCATGAAATTATTTTTACTTTTCGTACAACGCGTGAAGGTGG 

40 TAATATTGTCTTATCTGATGCTGAGTATGTTGAGTTAATCCAGAAAATTAA 

TTCTATCTACAATCCAGATTATATTGATTTTGAGTATTTTTCACATAAAGAA 

GTTTTTCAAGAAATGCTAGAATTTCCAAATTTAGTCCTGTCTTATCACAATT 

TTCAAGAGACACCGGAGAATATTATGGAGATATTTTCAGAATTAACAGCC 

CTAGCACCACGAGTTGTGAAAATCGCAGTAATGCCAAAGAATGAACAAGA 



TGTCTTAGACGTTATGAATTACACTCGCGGTTTCAAGACTATTAATCCTGA 
TCAAGTTTATGCGACGGTATCTATGAGTAAAATTGGACGTATTTCTCGTTTT 
GCTGGTGATGTAACTGGATCTAGTTGGACATTTGCATATTTAGATTCATCT 
ATCGCACCCGGACAAATTACTATTTCAGAGATGAAGCGTGTCAAAGCATT 
5 GCTTGACGCTGACTGA 

CSEILTD WO' lie) 

MKIVVPVMPRSLEEAQEIDLSKFDSVDIffiWRADALPKDDIINVAPAIFEKFAG 
HEIIFTFRTTREGGNIVLSDAEYVELIQKINSIYNPDYIDFEYFSHKEVFQEMLEF 
PNLVLSYHNFQETPENIMEIFSELTALAPRVVKIAVMPKNEQDVLDVMNYTRG 
1 0 FKTINPDQV YATVSMSKIGRISRFAGDVTGSSWTFA YLDSSIAPGQITISEMKRV 
KALLDAD* 



15 



20 



Sequence description: 



A] Length 678 bp, 225 aa (full length gene) 

B] No obvious signal peptide, but there is a 
Shine Dalgarno immediately upstream of ORF. 



ID-84 



Clone RS-52 

25 

ATGAAAGACTTATTTGCAACAACAGAAGCATCATCAAGGAAACAGGAACA 
AGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAACAAATGGTA 
ATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAATGAGATGACA 
30 GGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCGATTAGTTTT 
TCTGAAGATAGAATTGGTGGTAAACTTAGAGCATTAGGATATCAACCGAA 
TGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCAAAATGTTAATGA 
TATTGAAGTGATTTATATGAAGAAAGAATAG 

35 MKDLFATTEASSRKQEQDRIVNYIKQHVELTNGNQIKKIEFIDFQKNEMTGTW 
GISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSKDINSNNQNVNDrEVIYMK 
KE* 

Sequence description: 

40 

A] length: 333 bp - 1 1 1 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the ATG start codon, and no obvious signal 
peptide within the protein. 



ID-85 

5 



Clone RS-53 

( 5E£L 10 (VO - . 1*0 

1 0 ATGA AAA AACGTATATGGTATTTGAT A AT AAT AATC AC AGT AATTTTAGG A 
GGACTAGCCATGAAAAACTTATTTGCAACAACAGAAGCATCATCAAGGAA 
ACAGGAACAAGATAGAATTGTCAATTACATAAAACAACATGTTGAGTTAA 
CAAATGGTAATCAAATAAAAAAAATTGAGTTTATCGACTTTCAAAAAAAT 
GAGATGACAGGTACATGGGGAATTTCTACTAAAATTAATGAACAATTTTCG 

1 5 ATTAGTTTTTCTG AAGATAG AATTGGTGGT A AACTTAGAGC ATT AGGAT AT 
CAACCGAATGAAATAGGTTTTTCAAAGGACATCAATAGTAATAATCA 

(sen -20 too : po) 

MKKRIWYLIIIITVILGGLAMKNLFATTEASSRKQEQDRIVNYIKQHVELTNGN 
QIKKEFIDFQKNEMTGTWGISTKINEQFSISFSEDRIGGKLRALGYQPNEIGFSK 
20 DINSNNQ 



Sequence description: 

25 A] Length: 351 bp - 1 17 aa (Partial sequence) 

B] Obvious signal peptide and Shine Dalgarno 
sequence upstream of the ATG start codon. 

30 

ID-86 

Clone ID-74 

35 ATGTCAAATCAATATGATTATATCGTTATTGGTGGAGGTAGT 
GCAGGCAGTGGTACCGCTAATAGGGCAGCCATGTATGGAGC 
AAAAGTCCTGTTAATTGAAGGTGGACAAGTAGGTGGAACTTG 
TGTTAACTTAGGTTGTGTACCTAAGAAAATCATGTGGTATGG 
TGCACAAGTTTCTGAGACACTCCATAAGTATAGTTCAGGTTA 

40 TGGTTTTGAAGCCAATAATCTTAGTTTTGATTTTACTACTCTA 
AAAGCTAATCGCGATGCTTACGTGCAGCGGTCTAGACAGTCG 
TATGCCGCTAATTTTGAGCGTAATGGGGTCGAAAAGATTGAT 
GGATTTGCTCGTTTTATTGATAACCATACTATTGAAGTGAATG 
GTCAGCAATATAAAGCTCCTCACATTACTATTGCAACAGGTG 



GACACCCTCTTTACCCTGATATTATTGGAAGTGAACTTGGTG 
AGACTTCTGATGATTTTTTTGGATGGGAGACCTTACCAAATTC 
TATATTGATTGTTGGGGCGGGCTATATCGCGGCAGAACTTGC 
TGGAGTGGTTAATGAATTAGGCGTTGAAACCCATCTTGCATT 
5 TAGAAAAGACCATATTCTACGCGGATTTGATGACATGGTAAC 
AAGTGAGGTTATGGCTGAAATGGAGAAATCAGGTATCTCTTT 
ACATGCTAACCATGTACCTAAATCTCTTAAACGCGATGAAGG 
TGGCAAGTTGATTTTTGAAGCTGAAAATGGGAAAACGCTTGT 
CGTTGATCGTGTAATATGGGCTATCGGCCGTGGACCAAATGT 

1 0 AG AC ATGGG ACTTG A AAATACCG AT ATTGTTTT AA ATG AT AA 
AGATTATATCAAAACAGATGAATTTGAGAATACTTCTGTAGA 
TGGCGTGTATGCTATTGGAGATGTTAATGGGAAAATTGCCTT 
GACACCGGTAGCAATTGCAGCAGGTCGTCGCTTATCAGAAAG 
ACTTTTTAATCATAAAGATAACGAAAAATTAGATTACCATAA 

1 5 TGTACCTTC AGTT ATTTTT ACTC ACCCTGT AATTGGG ACGGTA 
GGACTTTCAGAAGCAGCAGCTATCGAGCAATTTGGAAAAGAT 
AATATCAAAGTCTATACATCAACTTTTACCTCTATGTATACGG 
CTGTTACCAGTAATCGCCAAGCAGTTAAGATGAAGCTCATAA 
CCCTAGGAAAAGAGGAAAAAGTTATTGGGCTTCATGGTGTTG 

20 GTTATGGTATTGATGAAATGATTCAAGGTTTTTCAGTTGCTAT 
CAAAATGGGGGCTACTAAAGCAGACTTTGATGATACTGTTGC 
TATTCACCCAACTGGATCTGAGGAATTTGTTACAATGCGCTA 
A 

25 MSNQYDYIVIGGGSAGSGTANRAAMYGAKVLLIEGGQVGGTC 
VNLGCVPKKIMWYGAQVSETLHKYSSGYGFEANNLSFDFTTLK 
ANRDAYVQRSRQSYAANFERNGVEKIDGFARFIDNHTffiVNGQ 
QYKAPHITIATGGHPLYPDIIGSELGETSDDFFGWETLPNSILIVG 
AGYIAAELAGVVNELGVETHLAFRKDHILRGFDDMVTSEVMAE 

30 MEKSGISLHANHVPKSLKRDEGGKLIFEAENGKTLVVDRVIWAI 
GRGPNVDMGLENTDIVLNDKDYIKTDEFENTSVDGVYAIGDVN 
GKIALTPVAIAAGRRLSERLFNHKDNEKLDYHNVPSVIFTHPVIG 
TVGLSEAAAffiQFGKDNIKVYTSTFTSMYTAVTSNRQAVKMKLI 
TLGKEEKVIGLHGVGYGIDEMIQGFSVAIKMGATKADFDDTVAI 

35 HPTGSEEFVTMR* 



BD-87 



Clone RS-55 

ATGACAAAAAAACATCTTAAAACGCTTGCCTTGGCACTTACTACAGTATCA 
5 GTAGTGACATACAGCCAGGAGGTATATGGATTAGAAAGAGAGGAATCGGT 
CAAACAAGAACAAACCCAGTCAGCTTCAGAAGATGATTGGTTCGAAGAAG 
ATAATGAGAGGAAAACAAATGTTTCTAAAGAGAATTCTACTGTTGATGAA 
ACAGTTAGTGATTTATTTTCTGATGGAAATAGTAATAACTCTAGTTCTAAA 
ACCGAGTCAGTGGTAAGTGACCCTAAACAAGTCCCCAAAGCAAAACCAGA 

1 0 GGTTACAC AAG A AGC AAGC A ATTCTAGTA ATGATGCT AGC AA AGT AGA AG 
TACCAAAACAGGATACAGCTTCAAAAAAGGAAACTCTAGAAACATCAACT 
TGGGAGGCAAAAGATTTCGTAACTAGAGGGGATACTTTAGTAGGTTTTTCA 
AAATCTGGAATTAATAAGTTATCTCAAACATCACACTTGGTTTTACCAAGT 
CATGCAGCAGATGGAACTCAATTGACACAAGTAGCTAGCTTTGCTTTTACT 

1 5 CC AGATAA AAAG ACGGCC ATTGC AG AAT AT AC A AGTAGGCT AGG AGAA A 
ATGGGAAACCGAGTCGTTTAGATATTGATCAGAAGGAAATTATTGATGAG 
GGAGAAATATTTAATGCTTACCAGTTGACTAAGCTTACTATTCCAAATGGT 
TATAAGTCTATTGGTCAAGATGCTTTTGTGGACAATAAGAATATTGCTGAG 
GTTAACCTTCCTGAGAGTCTCGAGACTATTTCAGACTATGCTTTTGCTCACA 

20 TGTCTTTAAAACAAGTAAAGTTACCAGATAACCTAAAGGTCATTGGAGAA 
TTAGCTTTTTTTGATAATCAGATTGGTGGTAAGCTTTACTTGCCACGTCACT 
TGATAAAATTAGCAGAACGCGCTTTCAAATCTAATCGTATTCAAACAGTTG 
AATTTTTGGGAAGTAAGCTTAAGGTTATAGGAGAAGCAAGTTTTCAAGAT 
AATAATCTGAGGAATGTTATGCTTCCGGATGGACTTGAAAAAATAGAATC 

25 AGAAGCTTTTACAGGAAATCCAGGAGATGAACATTACAACAATCAGGTTG 
TATTGCGCACAAGGACAGGCCAAAATCCACATCAACTTGCGACTGAGAAT 
ACTTACGTCAATCCGGACAAATCATTGTGGCGTGCAACACCTGATATGGAT 
TATACCAAATGGTTAGAGGAAGATTTTACCTATCAAAAAAATAGTGTTACA 
GGTTTTTCAAATAAAGGCTTACAAAAGGTAAGACGTAATAAAAACTTAGA 

30 AATTCCAAAACAACACAATGGTATTACTATTACTGAAATTGGTGATAACGC 
TTTTCGCAATGTTGATTTTCAAAGTAAAACTTTACGTAAATATGATTTGGA 
AGAAATAAAGCTCCCCTCAACTATTCGGAAAATAGGTGCTTTTGCTTTTCA 
ATCTAATAACTTGAAATCCTTTGAAGCAAGTGAAGATTTAGAAGAGATTA 
AAGAGGGAGCCTTTATGAATAATCGTATTGGAACTCTAGACTTGAAAGAC 

35 AAACTTATCAAAATAGGTGATGCTGCTTTCCATATTAATCATATTTATGCC 
ATTGTTCTTCCAGAATCTGTACAAGAAATAGGACGTTCAGCTTTTCGACAA 
AATGGTGCGCTTCACCTTATGTTTATCGGAAATAAGGTTAAAACAATTGGT 
GAAATGGCTTTTTTATCCAATAAACTGGAAAGTGTAAATCTCTCTGAGCAA 
AAACAATTAAAGACAATTGAGGTCCAAGCTTTTTCGGATAATGCCCTTAGT 

40 GAAGTAGTCTTACCGCCAAATTTACAGACTATTCGTGAAGAGGCTTTCAAA 
AGGAATCATTTGAAAGAAGTGAAGGGTTCATCTACATTATCTCAGATTACT 
TTTAATGCTTTTGATCAAAATGATGGGGACAAACGCTTTGGTAAGAAAGTG 
GTTGTTAGGACACATAATAATTCTCATATGTTAGCAGATGGTGAGCGTTTT 
ATCATTGATCCAGATAAGCTATCTTCTACAATGGTAGACCTTGAAAAGGTT 



TTAAAAATAATCGAAGGTTTAGATTACTCTACATTACGTCAGACTACTCAA 

ACTCAGTTTAGAGAAATGACTACTGCAGGTAAAGCGTTGTTATCAAAATCT 

AACCTCCGACAAGGAGAAAAACAAAAATTCCTTCAAGAAGCACAATTTTT 

CCTTGGTCGCGTTGATTTGGATAAAGCCATAGCTAAAGCTGAGAAGGCTTT 

AGTGACCAAGAAGGCAACAAAGAATGGTCATTTGCTTGAGAGGAGTATTA 

ACAAAGCGGTATTAGCTTATAATAATAGTGCTATTAAAAAAGCTAATGTTA 

AGCGCTTGGAAAAAGAGTTAGACTTGCTGACAGATTTAGTCGAGGGAAAA 

GGACCATTAGCGCAAGCTACAATGGTACAAGGAGTTTATTTATTAAAGAC 

GCCTTTACCATTGCCAGAATATTATATCGGATTGAACGTTTATTTTGACAA 

GTCTGGAAAATTGATTTATGCACTTGATATGAGTGATACTATTGGCGAGGG 

ACAAAAAGATGCATATGGTAATCCTATATTAAATGTTGACGAGGATAATG 

AAGGTTATCATACCTTGGCAGTTGCCACTTTAGCTGATTATGAAGGTCTTT 

ATATTAAAGATATTTTAAATAGTTCCCTTGATAAGATTAAAGCAATACGCC 

AGATTCCTTTGGCAAAATATCATAGATTAGGAATTTTCCAAGCTATCCGAA 

ATGCAGCGGCAGAAGCAGACCGATTGCTTCCTAAGACACCTAAGGGGTAC 

CTAAATGAAGTCCCAAATTATCGTAAAAAACAAATGGAGAAAAATTTAAA 

ACCAGTTGATTATAAAACGCCGATTTTTAATAAGGCTTTACCTAATGAAAA 

GGTAGACGGTGATAGAGCGGCTAAAGGTCATAATATAAATGCGGAGACTA 

ATAATTCTGTAGCTGTAACACCAATAAGGTCCGAGCAGCAATTACATAAGT 

CACAGTCTGATGTAAATTTACCTCAAACAAGTTCTAAAAATAATTTTATAT 

ACGAGATTCTAGGATACGTTAGTTTATGTTTGCTTTTCCTAGTAACTGCTGG 

GAAAAAAGGAAAACGAGCAAGAAAATAA 

C 56GL TO W. «?h) 

MTKKHLKTLALALTTVSVVTYSQEVYGLEREESVKQEQTQSASEDDWFEEDN 

ERKTNVSKENSTVDETVSDLFSDGNSNNSSSKTESVVSDPKQVPKAKPEVTQE 

ASNSSNDASKVEVPKQDTASKKETLETSTWEAKDFVTRGDTLVGFSKSGINKL 

SQTSHLVLPSHAADGTQLTQVASFAFTPDKKTAIAEYTSRLGENGKPSRLDIDQ 

KEIIDEGEIFNAYQLTKLTIPNGYKSIGQDAFVDNKNIAEVNLPESLETISDYAF 

AHMSLKQVKLPDNLKVIGELAFFDNQIGGKLYLPRHLIKLAERAFKSNRIQTV 

EFLGSKLKVIGEASFQDNNLRNVMLPDGLEKffiSEAFTGNPGDEHYNNQVVLR 

TRTGQNPHQLATENTYVNPDKSLWRATPDMDYTKWLEEDFTYQKNSVTGFS 

NKGLQKVRRNKNLEIPKQHNGITITEIGDNAFRNVDFQSKTLRKYDLEEIKLPS 

TIRKIGAFAFQSNNLKSFEASEDLEEIKEGAFMNNRIGTLDLKDKLIKIGDAAFH 

INHIYAIVLPESVQEIGRSAFRQNGALHLMFIGNKVKTIGEMAFLSNKLESVNL 

SEQKQLKTffiVQAFSDNALSEVVLPPNLQTIREEAFKRNHLKEVKGSSTLSQITF 

NAFDQNDGDKRFGKKVVVRTHNNSHMLADGERFIIDPDKLSSTMVDLEKVL 

KIIEGLDYSTLRQTTQTQFREMTTAGKALLSKSNLRQGEKQKFLQEAQFFLGR 

VDLDKAIAKAEKALVTKKATKNGHLLERSINKAVLAYNNSAIKKANVKRLEK 

ELDLLTDLVEGKGPLAQATMVQGVYLLKTPLPLPEYYIGLNVYFDKSGKLF^A 

LDMSDTIGEGQKDAYGNPILNVDEDNEGYHTLAVATLADYEGLYIKDILNSSL 

DKIKAIRQIPLAKYHRLGIFQAIRNAAAEADRLLPKTPKGYLNEVPNYRKKQM 

EKNLKPVDYKTPIFNKALPNEKVDGDRAAKGHNINAETNNSVAVTPIRSEQQL 

HKSQSDVNLPQTSSKNNFIYEELGYVSLCLLFLVTAGKKGKRARK* 



Sequence description: 



5 A] Length 3168 bp - 1056 aa (Partial sequence) 

B] Obvious signal peptide with Shine Dalgarno 
sequence upstream of the ATG start codon. 

10 

ID-88 



Clone RS-56 

15 

(SEd 10 4VD\ 35) 

GCAGGATACATCATGCACAAGCACGAGGCTATCGTGTCATGCTGGGGTCA 
ACCCAGGAAGACATGTCGGCACAAGCTGAAGATTTCTTTACAGTCTGTACA 
CAATAAAGAGACGGGTAAGAGCGCTTTTAATGACAAAGAACGACTAGCAA 
20 TT 

AGYIMHKHEArVSCWGQPRKTCRHKLKISLQSVHNKETGKSAFNDKERLAI 



25 Sequence description: 

A] Length: 153 bp - 51 aa (partial sequence) 

B] No signal peptide visible, insufficient 
sequence data to determine the presence of a 

30 Shine Dalgarno sequence. 



ID-89 

35 

Clone RS-58 
(S&L TO V0\&) 

GTGTC ATTTATGCAAAG AAAATCCTATTTAA AATCCATGAGTGTTCTTACT 
40 TTAACAGCTTGTCTTATATCAGGATATGTGGTTAAAGATATTGCTATGTTA 
CATGCAGTATCTGCCAGTGAGAAGAAAGCAAATAATGTCAGTCCGAGAGA 
AAATCTCTACAGGGCTGTCAATGATAATTGGCTAGCCAATACAAAACTCA 
AACAAGGGCAGACTAGTGTTAATAGTTTTTCAGAAATTGAGGATAAATTA 
AAGCAACTGTTAGTGTCTGATATGGCTAAAATGGCCTCAGGAAAGATTGA 



AACAACCAATGATGAACAGAAAAAAATGGTTGCATACTATAAACAAGGTA 
TGGACTTTAAAACAAGAGATAAAAATGGTCTCAAACCTCTAAAACCAGTT 
TTACAAAAACTTGAAGCAGTCTCTTCAATGAAAGACTTTCAAAGTTTGGCC 
CATGATTTTGTGATGAGTGGTTTTGTTTTACCATTTGGTTTGACTGTGGAAA 
5 CCAATGCTCGAGATAATAGCCAAAAGCAATTGGTGCTTCGTCAAGCACCC 
GCATTACTTGAATCACCTGACCAATATAAGAAGGGCAATAAAGAAGGTGA 
GGCTAAATTATCAGCTTACCGTACTTCAGCAATGGCTTTGCTTAAACAAGC 
TGGAAAAAGTAACATTGAAGATAGAAAACTAGTTAAACAAGCTATAGCAT 
TTGATAGACTCTTATCAGAAAAAACGCAAGTTGATCAAAGTAAAATCACA 

1 0 GCTG A A AGTG AG AC AGCTGCGGGGCGAT ATA ACCCTG A A AGT ATGG A AAC 
GGTTCACAATTACGCCAAGGAATTTGACTTTAAAGAATTGATTGAAAAACT 
AGTTGGGCCAACGAATAAGGCAGTCAATGTAGAAGATAAAACTTATTTTA 
AACAGGTTAATGATGTTATAAATAGTAAACAATTAGCCAATATGAAAGCA 
TGGATGATGATTTCTATGCTAGTTGATCAATCAGATTTTCTAGGAGAACAA 

1 5 A ATCGTC AAGC AGCG AGTGCTTTTA AG AATGTTGCGTCTGGTTTGACTC AG 
ATTGAATCGAAAGAAAAAATGCTTACACCCAATTAG 



(SE£L SO fl)0\ <3>g) 

20 MSFMQRKSYLKSMSVLTLTACLISGYVVKDIAMLHAVSASEKKANNVSPREN 
LYRAVNDNWLANTKLKQGQTSVNSFSEIEDKLKQLLVSDMAKMASGKIETTN 
DEQKKMVAYYKQGMDFKTRDKNGLKPLKPVLQKLEAVSSMKDFQSLAHDF 
VMSGFVLPFGLTVETNARDNSQKQLVLRQAPALLESPDQYKKGNKEGEAKLS 
AYRTSAMALLKQAGKSNffiDRKLVKQAIAFDRLLSEKTQVDQSKJTAESETAA 

25 GRYNPESMETVHNYAKEFDFKELIEKLVGPTNKAVNVEDKTYFKQVNDVINS 
KQLANMKAWMMISMLVDQSDFLGEQNRQAASAFKNVASGLTQffiSKEKMLT 
PN* 



30 Sequence description: 

A] Length: 1095 bp - 365 aa (full length gene) 

B] an GTG (possible ATG start codon located 7 bp 
further downstream) start codon with an obvious 

35 signal peptide. Shine Dalgarno sequence present 

upstream of the ORF. 



ID-90 

40 



Clone RS-59 



ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
GTTATTTAAATATTTTTTATAG 

CSEGl -2$ m \ 3d) 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLFK 
YFL* 



Sequence description: 



ID-91 



Clone RS-62 (partial sequence) 
(SELTO AJ£?\3l} 

ATGCAGGTATTTTTAAATATTGTCAATAAATTCTTTGATCCAGTTATTCATA 
TGGGTTCGGGAGTTGTGATGCTAATTGTCATGACAGGTTTAGCCATGATAT 
TTGGAGTGAAGTTTTCTAAAGCACTTGAAGGTGGTAT 

MQVFLNWNKEFDPVIHMGSGVVMLIVMTGLAMIFGVKFSKALEGG 



Sequence description: 



A] Length: 174 bp - 58 aa(full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence is present upstream of ATG 
start codon. 



A] Length: 141 bp - 41 aa (partial sequence 

B] Shine Dalgarno sequence present upstream of 
ATG start codon with a possible signal peptide 
present 



ID-92 



Clone RS-69 (partial sequence) 
(SE&. ZTO fMO'. 53) 

ATGAAAAAGAAAACATTCAGTGCTTATAACTTTTTAACGGCTCTTATCCTT 
TGTCTTTTGACAGTGCTTTTTATCTTTCCATTTTATTGGATTATGACAGGAG 
CTTTTAA 

(SEk JH) NO'- S«0 

MKKKTFSAYNFLTALILCLLTVLFEFPFYWIMTGAF 



Sequence description: 



ID-93 



Clone RS-70 

csea. id m\ 35) 

ATGACTGAGAACTGGTTACATACTAAAGATGGTTCAGATATTTATTATCGT 

GTCGTTGGTCAAGGTCAACCGATTGTTTTTTTACATGGCAATAGCTTAAGT 

AGTCGCTATTTTGATAAGCAAATAGCATATTTTTCTAAGTATTACCAAGTT 

ATTGTTATGGATAGTAGAGGGCATGGCAAAAGTCATGCAAAGCTAAATAC 

CATTAGTTTCAGGCAAATAGCAGTTGACTTAAAGGATATCTTAGTTCATTT 

AGAGATTGATAAAGTTATATTGGTAGGCCATAGCGATGGTGCTAATTTAGC 

TTTAGTTTTTCAAACGATGTTTCCAGATATGGTTAGAGGGCTTTTGCTTAAT 

TCAGGGAACCTGACTATTCATGGTCAGCGATGGTGGGATATTCTTTTAGTA 

AGGATTGCCTATAAATTCCTTCACTATTTAGGGAAACTCTTTCCGTATATG 

AGGCAAAAAGCTCAAGTTATTTCGCTTATGTTGGAGGATTTGAAGATTAGT 

CCAGCTGATTTACAGCATGTGTCAACTCCTGTAATGGTTTTGGTTGGAAAT 

AAGGACATAATTAAGTTAAATCATTCTAAGAAACTTGCTTCTTATTTTCCA 

AGGGGGGAGTTTTATTCTTTAGTTGGCTTTGGGCATCACATTATTAAGCAA 

GATTCCCATGTTTTTAATATTATTGCAAAAAAGTTTATCAACGATACGTTG 

AAAGGAGAAATTGTTGAAAAAGCTAATTGA 

MTENWLHTKDGSDIYYRVVGQGQPIVFLHGNSLSSRYFDKQIAYFSKYYQVIV 
MDSRGHGKSHAKLNTISFRQIAVDLKDILVHLEIDKVILVGHSDGANLALVFQ 



A] Length: 1 10 bp -36 aa (Partial sequence) 

B] Possible signal peptide with Shine Dalgarno 
sequence directly upstream of the ATG start 
codon. 



TMFPDMVRGLLLNSGNLTIHGQRWWDILLVRIAYKFLHYLGKLFPYMRQKA 
QVISLMLEDLKISPADLQHVSTPVMVLVGNKDIIKLNHSKKLASYFPRGEFYSL 
VGFGHHIIKQDS H VFNIIAKKFINDTLKGEI VEKAN * 



Sequence description: 

A] Length: 744 bp - 248 aa (full length gene) 

B] No obvious signal peptide, but Shine 
Dalgarno sequence upstream of the ATG start 
codon. 



ID-94 



Clone RS-71 

/ SEGL '50 NO : 3^ 

ATGGTAGCAAAAGAGTTAGGTAAAAATAGCTTTACTATCCCAACTATTTGT 

TCTAATTGCTCCGCAGGTACTGCCATTGCAGTTGTATATAATGATGACCAT 

TCTTTCTTAAGATACGGCTATCCCGAGTCTCCACTTCATATTTTTATCAATA 

CACGGATCATTGCACAGGCACCAAGCAAATATTTTTGGGCTGGTATTGGGG 

ACGGTATTTCAAAAGCCCCTGAAGTAGAACGTGCTACCTTAGAGGCTAAG 

ACCAATAAACTACCACATACTGCAGTGTTAGGACAAGCAGTCGCTCTGTCT 

TCAAAGGAAGCTTTTTATCAATTTGGTGAACAAGGTCTAAAAGACGTTGAA 

GCTAATTTAGCTTCGCGTGCAGTTGAAGAAATTGCGCTTGATATCTTA 

C SSL ~lv rod : 

MVAKELGKNSFTIPTICSNCSAGTAIAVVYNDDHSFLRYGYPESPLHIFINTRIIA 
QAPSKYFWAGIGDGISKAPEVERATLEAKTNKLPHTAVLGQAVALSSKEAFY 
QFGEQGLKD VE ANL AS RA VEEIALDIL 

Sequence description: 

A] Length: 405 bp - 135 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence upstream - - 

of the ATG start codon, probable signal 

peptide present at the N-terminus. 



ID-95 



Clone RS-73 



5 TTGAGGGAAACTTACTGGAAAATTTCAAGCGATTGCGATAAAATAAATCTT 
GCAGAGTTTTCTAGAGAAAGGAGGTCAGATTTATTGGAGTGGCAAGATCT 
AGCGCAGTTACCTGTATCTATTTTTAAAGACTATGTTACAGATGCTCAAGA 
CGCGGAAAAACCTTTTATATGGACAGAAGTATTTTTAAGGGAGATTAATCG 
CTCAAATCAAGAAATTATTTTGCATATTTGGCCGATGACTAAGACAGTCAT 

1 0 TCTGGGG ATGTT AG ATCG AG A ATT ACC AC ATTT AG A ATT AGCT A A A A A AG 
AAATCATCAGTCGTGGTTATGAACCAGTTGTTCGGAATTTTGGAGGTCTCG 
CAGTTGTAGCTGATGAAGGAATTTTAAATTTTTCATTGGTTATTCCAGATGT 
TTTTGAGAGAAAATTGTCTATCTCAGATGGGTATCTTATAATGGTCGATTTT 
ATTAGAAGTATATTTTCGGATTTTTATCAACCTATTGAGCACTTTGAAGTA 

1 5 G AG ACCTCCTATTGTCCTGGTAAGTTTGATCTTAGTAT AAATGGC AAAAAA 
TTTGCTGGCTTGGCTCAGCGCCGTATAAAGAATGGTATTGCGGTATCAATT 
TACCTTAGCGTTTGTGGCGATCAAAAAGGGCGGAGTCAAATGATTTCAGAT 
TTTTATAAGATTGGTCTAGGTGATACGGGTAGTCCAATTGCTTATCCAAAT 
GTAGATCCTGAAATTATGGCTAATCTATCTGATCTATTAGATTGTCCTATG 

20 ACAGTAGAAGATGTTATTGATCGTATGTTGATTAGCCTTAAACAAGTAGGT 
TTTAATGATCGTTTACTGATGATTAGACCCGATTTAGTTGCAGAGTTTGAT 
AGATTTCAGGCTAAGTCTATGGCTAATAAGGGGATGGTGAGCAGAGATGA 
ATAA 

25 ($e6l io too: mo) 

MRETYWKISSDCDKENLAEFSRERRSDLLEWQDLAQLPVSIFKDYVTDAQDAE 
KPFIWTEVFLREINRSNQEIILHIWPMTKTVILGMLDRELPHLELAKKEnSRGYE 
PVVRNFGGLAVVADEGILNFSLVIPDVFERKLSISDGYLIMVDFIRSIFSDFYQPI 
EHFEVETSYCPGKFDLSINGKKFAGLAQRRIKNGIAVSIYLSVCGDQKGRSQMI 
30 SDFYKIGLGDTGSPIAYPNVDPEIMANLSDLLDCPMTVEDVIDRMLISLKQVGF 
NDRLLMIRPDLVAEFDRFQAKSMANKGMVSRDE* 



Sequence description: 

35 

A] Length: 921 bp -307 aa (Full-length gene sequence) 

B] No obvious Shine Dalgarno sequence upstream 
of the TTG start codon or signal peptide 

- - visible. Actual start point may be a further 

40 85 bp downstream (TTG). This start point is 

preceded by a typical Shine-Dalgarno sequence. 



ID-96 



Clone RS-74 

(SEJSl ^0 NO'- Ml) 

TTGGAAGGTTTACTTATTGCATTGATTCCCATGTTTGCGTGGGAAAGTATT 

GGATTTGTTAGTAATAAAATTGGAGGGCGTCCAAATCAACAAACATTTGG 

AATGACTTTAGGAGCATTGCTATTTGCGATTATCGTATGGTTATTTAAACA 

GCCAGAGATGACTGCCTCATTGTGGATTTTTGGTATCTTAGGTGGTATCCT 

ATGGTCAGTCGGCCAAAATGGTCAATTTCAAGCAATGAAATATATGGGAG 

TCTCTGTTGCTAATCCACTGTCAAGTGGTGCACAATTAGTAGGTGGAAGCC 

TAGTTGGTGCTTTAGTCTTTCATGAATGGACTAAGCCAATCCAATTTATTTT 

AGGATTGACAGCGTTGACATTATTAGTTATCGGCTTCTATTTCTCAAGTAA 

ACGTGATGTTTCAGAACAAGCTTTGGCAACACATCAAGAGTTTTCAAAAG 

GATTTGCTACAATTGCTTATTCAACTGTAGGTTACATCTCGTACGCAGTTTT 

ATTTAACAACATTATGAAGTTCGACGCTATGGCCGTCATTTTACCCATGGC 

TGTTGGAATGTGTCTAGGTGCAATTTGTTTCATGAAGTTTCGTGTTAACTTT 

GAGGCTGTTGTTGTTAAAAATATGATTACAGGTCTCATGTGGGGCGTTGGT 

AATGTCTTCATGTTATTGGCAGCAGCTAAAGCAGGGCTAGCAATTGCTTTT 

AGTTTTTCTCAACTTGGAGTAATTATCTCTATTATTGGTGGTATTTTATTTTT 

AGGTGAGACAAAAACGAAGAAAGAGCAGAAATGGGTTGTCATGGGTATC 

CTTTGTTTTGTTATGGGTGCTATATTACTTGGTATTGTTAAATCTTATTAA 

{S6&. ~?0 N0\ 43) 

MEGLLIALIPMFAWESIGFVSNKIGGRPNQQTFGMTLGALLFAITVWLFKQPEM 

TASLWIFGILGGILWSVGQNGQFQAMKYMGVSVANPLSSGAQLVGGSLVGAL 

VFHEWTKPIQFILGLTALTLLVIGFYFSSKRDVSEQALATHQEFSKGFATIAYST 

VGYISYAVLFNNIMKJT)AMAVILPMAVGMCLGAICFMKFRVNFEAVVVKNMI 

TGLMWGVGNVFMLLAAAKAGLAIAFSFSQLGVIISIIGGILFLGETKTKKEQK 

WVVMGILCFVMGAILLGIVKSY* 



Sequence description: 

A] Length: 867 bp - 289 aa (full-length gene) 

B] Posible Shine Dalgarno sequence upstream of 
GTG start codon, no obvious signal peptide 
present. 



ID-97 



Clone RS-75 

/w to no '■ H2) 

ATGACAACTTACTACGAAGCTATAAACTGGAACGAAATTGAAGATGTTAT 
5 TGATAAATCAACTTGGGAAAAACTAACCGAACAATTTTGGCTCGATACAC 
GTATCCCTTTATCAAATGACTTAGACGATTGGCGCAAACTTTCCGCTCAAG 
AAAAAGATCTTGTTGGCAAGGTTTTTGGAGGCTTAACCCTACTTGATACCA 
TGCAATCAGAAACTGGTGTTGAAGCTATTCGTGCCGATGTTCGCACGCCTC 
ACGAAGAAGCTGTCTTAAACAATATTCAATTCATGGAATCTGTTCACGCTA 

1 0 AATCTT ATTCTTC AATTTTCTC AACTTT A A AT ACT AA ATC AG A A ATTG A AG 
AAATTTTCGAGTGGACTAATAATAATGAGTTCCTTCAAGAAAAAGCACGT 
ATTATCAATGACATTTATGCTAATGGAAATGCCCTTCAAAAAAAGGTGGCT 
TCCACCTACCTCGAAACTTTCCTTTTTTATTCTGGCTTTTTCACACCTCTTTA 
CTATTTGGGAAATAATAAGTTAGCAAATGTTGCTGAAATCATTAAATTAAT 

1 5 TATTCGTGATG AATCTGTACATGGTACTTATATCGGTTACAAATTCC AGCTT 
GGTTTTAACGAATTACCAGAAGATGAGCAAGAGAATTTTCGTGATTGGAT 
GTATGACCTCCTTTATCAGCTGTATGAAAACGAAGAAAAATACACCAAGA 
CACTTTATGATGGCGTAGGATGGACTGAAGAAGTTATGACCTTTTTACGCT 
ACAATGCTAATAAAGCTCTTATGAATTTAGGACAAGATCCTTTATTCCCAG 

20 ATACAGCAAATGATGTCAACCCAATTGTTATGAATGGTATTTCAACAGGAA 
CATCAAACCATGACTTCTTCTCTCAAGTAGGTAATGGTTACCTACTTGGTA 
GCGTTGAAGCTATGCATGATGATGACTATAACTATGGATTATAA 

MTTYYEAINWNEffiDVIDKSTWEKLTEQFWLDTRIPLSNDLDDWRKLSAQEK 
25 DLVGKVFGGLTLLDTMQSETGVEAIRADVRTPHEEAVLNNIQFMESVHAKSY 
SSIFSTLNTKSEffiEIFEWTNNNEFLQEKARIINDIYANGNALQKKVASTYLETF 
LFYSGFFTPLYYLGNNKLANVAEIIKLIIRDESVHGTYIGYKFQLGFNELPEDEQ 
ENFRDWMYDLLYQLYENEEKYTKTLYDGVGWTEEVMTFLRYNANKALMNL 
GQDPLFPDTANDVNPIVMNGISTGTSNHDFFSQVGNGYLLGSVEAMHDDDYN 
30 YGL* 



Sequence description: 

35 A] Length: 960 bp - 320 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, but no signal peptide 
present. 

40 



ID-98 



Clone RS-77 (partial sequence) 

(<E& 3:0 MO'. H5) 

ATGAATTGGTCACGTATCTGGGAACTCGTAAAAATTAATATCCTTTATTCA 
5 AACCCTCAGACTCTATCGGCACTAAGAAAAAAGCAAGAAAAGCATCCTAA 
AAAAGAATTTTCAGCTTATAAATCCATGTTTAGAAATCAGTTATTTCAGAT 
TTTGCTCTTTTCAATAATTTATGTATTTCTCTTTGTATCACTTGATTTTAAAG 
AATATCCGGGCTATTTCACGTTCTACATTGGTATCTTTACACTAGTATCCAT 
TATCTACTCTTTTATTGCGATGTACAGTGTTTTCTATGAGAGTGACGATGTT 
10 AA 

MNWSRIWELVKINILYSNPQTLSALRKKQEKHPKKEFSAYKSMFRNQLFQILL 
FSIIYVFLFVSLDFKEYPGYFTFYIGIFTLVSIIYSFIAMYSVFYESDDV 

15 

Sequence description: 

A] Length: 311 bp - 103 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
20 ATG start codon, no obvious signal peptide at 

N-terminus. 

ID-99 

25 

Clone RS-78 (partial sequence) 
(S&LTD f\)0'. M^B 

TAATCTTTTAGTCAACGGAGCAACAGGAAAATTGCAGGCTATGCGACAGA 
30 TATTCCACCACATAATTTAGCAGAAGTCATTGATGCTGTCGTGTACATGAT 
TGATCACCCTAAAGCTAAATTAGATAAATTAATGGAATTTCTACCTGGTCC 
AGATTTTCCAACTGGCGCTATCATTCAAGGAAAAGATGAAATTCGTAAGG 
CATATGAGACTGGTAAGGGGAGAGTAGCGGTTCGCTCGCGAACTGCTATT 
GAAACCTTAAAAGGTGGTAAGAAACAAATTATTGTTACTGAAATTCCTTAT 
35 GAAGTTAAT 

Cseil -J.D A/a; nr) 

sfsqrsnrkiagyatdipphnlaevidavvymidhpkakldklmeflpgpdfpt 
gaiiqgkdeirkayetgkgrvavrsrtaietlkggkkqiivteipyevn 

40 

Sequence description: 

A] Length: 312 bp - 104 aa (Partial sequence) 

B] No obvious Shine Dalgarno sequence or a 



signal peptide. Both N- and C- termini of ORF 
yet to be elucidated. 

5 ID-100 



Clone RS-79 

10 (seo-^o no' hi) 

ATGGGACGTAAGTGGGCCAATATTGTTGCCAAAAAGACTGCTAAAGATGG 
TGCTAACTCAAAAGTATACGCTAAATTCGGTGTTGAAATATATGTTGCTGC 
AAAGCAAGGTGAACCAGACCCCGAGTCAAACTCAGCTCTAAAATTCGTTT 
TGGACCGTGCTAAGCAAGCACAAGTTCCAAAGCATGTTATTGATAAAGCG 

1 5 ATTGAT AA AGCC AA AGG AAAC AC AG ATG A A ACTTTCGTAGAGGG ACGCTA 
TGAAGGTTTTGGTCCAAATGGTTCAATGATTATTGTGGATACTTTGACATC 
AAATGTTAACCGTACGGCAGCAAATGTACGTACTGCTTACGGTAAGAACG 
GTGGCAATATGGGAGCTTCAGGATCGGTATCCTACTTATTTGATAAAAAAG 
GTGTCATCGTTTTTGCTGGTGATGATGCTGACACTGTCTTCGAACAATTACT 

20 TGAAGCGGATGTAGACGTAGATGATGTTGAAGCAGAAGAGGGAACAATA 
ACAGTTTATACCGCCCCAACAGATCTTCATAAAGGTATCCAAGCACTTCGC 
GATAATGGTGTAGAAGAATTCCAAGTTACTGAACTTGAAATGATTCCTCAA 
TCAGAAGTAGTATTGGAAGGTGATGACCTTGAAACTTTTGAAAAGCTT 

25 MGRKWANIVAKKTAKDGANSKVYAKFGVEIYVAAKQGEPDPESNSALKFVL 
DRAKQAQVPKHVIDKAIDKAKGNTDETFVEGRYEGFGPNGSMIIVDTLTSNV 
NRTAANVRTAYGKNGGNMGASGSVSYLFDKKGVrVFAGDDADTVFEQLLEA 
DVDVDDVEAEEGTITVYTAPTDLHKGIQALRDNGVEEFQVTELEMIPQSEVVL 
EGDDLETFEKL 

30 

Sequence description: 

A] Length: 654 bp - 218 aa (Partial sequence) 
35 B] Possible Shine Dalgarno sequence upstream 

of ATG start, no obvious signal peptide 



40 ID-101 



Clone RS-80 



TTGGAGAAATATTTGAAGAACCCGATTACATGGATTGGATTAGTTCTTGTG 
GTTACGTGGTTTTTAACTAAAAGTAGTGAATTTTTGATTTTTGGTGTGTGTG 
TCTTGTTGTTAGTATTTGCTAGTCAAAGTGAT 

(SEQl -xd mk SO 

MEKYLKNPITWIGLVLVVTWFLTKSSEFLIFGVCVLLLVFASQSD 



Sequence description: 

10 

A] Length: 135 bp - 45 aa (partial sequence) 

B] Shine Dalgarno sequence upstream of TTG 
start codon with possible signal peptide 
evident at N-terminus. 



ID- 102 

20 

Clone RS-81 

($E$L 1-0 f00\ 5& 

ATGACACAATCAGATGCATATCTCTCGTTGAACGCGAAGACACGCTTTAGA 
25 GATCGCACAGGTAATTATCATTTTACTTCGGATAAAGAGGCTGTTGAACAA 
TATATGATAGAACATGTTGAACCTAATACGATGGTGTTCACATCACTAATT 
GAAAAGCTAGATTATTTGGTTTCTAATAACTACTATGAATCGGACCTTCTA 
AAACAATATAACCTTGAGTTTATTTGCCAAATTTTTGAGCATGCATACGCT 
AAGAAATTTGCTTTTCTAAATTTTATGGGGGCTTTAAAATTTTATAATGCTT 
30 ATGCTCTTAAT 

(Sea. jo no: 5m3 

MTQSDAYLSLNAKTRFRDRTGNYHFTSDKEAVEQYMIEHVEPNTMVFTSLIE 
KLDYLVSNNYYESDLLKQYNLEFICQIFEHAYAKKFAFLNFMGALKFYNAYA 
35 LN 



Sequence description: 



40 



A] Length: 318 bp - 106 aa (Partial sequence) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, no obvious signal peptide 



ID- 103 



Clone 2-11 A 

5 

(SEQl XD NO : 56} 

ATGGTATTTATGGCAAATAAGAAAAAAACAAAAGGAAAGAAAACCAGAA 
GACCTACTAAGGCAGAAATAGAGCGTCAAAGAGCTATTCAAAGGATGATT 
ACTGCTCTTGTTTTAACAATTATTCTCTTCTTTGGTATTATCAGATTAGGTA 

1 0 TTTTTGGTATT AC AGTCT AT A ACGTC ATCCGTTTTATGGTAGGTAGCTTGGC 
TTACTTATTTATTGCGGCAACTTTAATCTACCTTTATTTCTTTAAATGGTTG 
CGAAAGAAAGATAGCTTAGTAGCAGGTTTTTTGATAGCTTCTTTAGGATTA 
TTGATTGAGTGGCATGCTTACCTTTTCTCAATGCCTATTTTGAAAGATAAA 
GAAATTTTGCGTTCAACTGCTCGATTAATTGTGTCTGATTTAATGCAATTTA 

1 5 A AATC ACTGTTTTTGCCGGTGGAGGT ATGTTGGGTGCTTTG ATTT AC A AGC 
CAATTGCTTTTCTCTTTTCTAATATTGGTGCCTATATGATTGGTGTTCTCTTC 
ATCATTTTGGGTCTCTTTTTAATGAGTTCTCTGGAAGTTTATGACATCGTCG 
AATTTATTAGAGCTTTTAAAAATAAAGTGGCAGAGAAGCACGAGCAAAAT 
AAAAAGGAGCGTTTTGCTAAGCGAGAGATGAAAAAAGCAATCGCTGAACA 

20 AGAGCGCATAGAGCGTCAAAAAGCTGAAGAAGAAGCTTATTTAGCTTCGG 
TTAATGTAGACCCTGAAACGGGTGAGATTCTAGAGGATCAAGCTGAGGAC 
AATTTGGATGATGCGCTACCACCTGAGGTAAGTGAAACATCAACTCCGGT 
ATTTGAGCCAGAGATCCTTGCTTATGAGACATCGCCTCAAAATGATCCTTT 
ACCAGTAGAGCCGACAATTTATTTAGAAGACTATGATTCGCCGATTCCTAA 

25 TATGAGAGAAAATGATGAGGAAATGGTTTATGATTTAGATGATGATGTAG 
ATGATAGTGATATAGAAAATGTCGACTTTACACCTAAAACGACACTGGTTT 
ATAAATTACCAACGATAGATTTATTTGCACCAGATAAGCCTAAAAATCAAT 
CCAAAGAAAAGGATTTAGTCCGAAAGAATATCAGAGTTTTAGAAGAAACA 
TTTAGAAGTTTTGGTATCGATGTAAAAGTAGAACGTGCTGAAATTGGACCA 

30 TCAGTTACTAAATATGAAATTAAACCAGCAGTTGGAGTTCGTGTGAATCGT 
ATTTCAAATCTATCTGACGACCTAGCTCTTGCTCTTGCAGCAAAAGATGTG 
CGTATAGAAGCACCAATTCCTGGAAAATCATTAATAGGTATTGAAGTTCCT 
AACTCAGAAATTGCAACGGTTTCTTTCCGCGAACTTTGGGAACAATCTGAT 
GCCAATCCTGAAAACCTTTTAGAAGTACCACTAGGAAAAGCTGTTAACGG 

35 CAATGCTCGCAGTTTTAACTTAGCTAGAATGCCGCATCTTTTGGTAGCTGG 
TTCAACTGGTTCAGGTAAATCTGTGGCAGTTAATGGAATTATTTCAAGTAT 
TTTGATGAAGGCACGTCCAGATCAAGTTAAGTTTATGATGATTGATCCCAA 
AATGGTTGAATTATCTGTTTATAATGATATTCCACATTTATTAATCCCTGTT 
GTAACCAATCCGCGTAAAGCAAGTAAGGCACTCCAAAAAGTTGTTGATGA 

40 AATGGAAAATCGATACGAGTTATTTAGCAAAATTGGTGTGCGTAATATAG 
CAGGTTATAATACAAAGGTTGAAGAGTTTAATGCTTCCTCTGAGCAAAAAC 
AAATGCCTTTGCCTTTAATCGTTGTCATTGTAGATGAATTGGCTGACTTGAT 
GATGGTTGCTAGTAAAGAAGTTGAAGATGCTATTATTCGTTTGGGGCAAAA 
AGCACGTGCTGCAGGTATCCATATGATTCTTGCAACTCAACGTCCATCCGT 



AGATGTTATTTCTGGTTTGATTAAAGCAAATGTTCCGTCGCGTATTGCATTT 
GCTGTTTCAAGTGGTACTGATAGCCGTACGATCCTTGATGAAAATGGTGCT 
GAAAAGCTCTTGGGACGGGGTGACATGCTCTTTAAGCCTATTGATGAGAAT 
CATCCAGTACGACTACAAGGTTCCTTTATTTCAGATGATGATGTTGAAAGG 
5 ATCGTTGGTTTTATCAAAGACCAAGCCGAGGCTGACTATGATGATGCCTTT 
GATCCTGGAGAAGTATCTGAAACAGATAACGGCTCTGGTGGTGGCGGCGG 
AGTACCTGAAAGTGATCCTCTTTTTGAAGAAGCCAAGGGACTCGTTTTAGA 
GACGCAAAAAGCAAGTGCCTCAATGATTCAACGCCGATTGTCTGTTGGTTT 
CAATAGAGCAACAAGACTAATGGAAGAATTAGAAGCAGCGGGGGTTATTG 
1 0 GTCC AGC AG A AGG AACC AAGCC ACG AA A AGTTTT A ATGACTCC A ACTCCG 
AGTGAATAA 

MVFMANKKKTKGKKTRRPTKAEIERQRAIQRMITALVLTIILFFGIIRLGIFGIT 
VYNVIRFMVGSLAYLFIAATLIYLYFFKWLRKKDSLVAGFLIASLGLLffiWHA 

1 5 YLFSMPILKDKEILRSTARLIVSDLMQFKITVFAGGGMLGALIYKPIAFLFSNIG 
AYMIGVLFIILGLFLMSSLEVYDIVEFIRAFKNKVAEKHEQNKKERFAKREMK 
KAIAEQERIERQKAEEEAYLASVNVDPETGEILEDQAEDNLDDALPPEVSETST 
PVFEPEILAYETSPQNDPLPVEPTIYLEDYDSPIPNMRENDEEMVYDLDDDVDD 
SDENVDFTPKTTLVYKLPTIDLFAPDKPKNQSKEKDLVRKNIRVLEETFRSFGI 

20 DVKVERAEIGPSVTKYEIKPAVGVRVNRISNLSDDLALALAAKDVRffiAPIPGK 
SLIGffiVPNSEIATVSFRELWEQSDANPENLLEVPLGKAVNGNARSFNLARMPH 
LLVAGSTGSGKSVAVNGIISSILMKARPDQVKFMMIDPKMVELSVYNDIPHLLI 
PVVTNPRKASKALQKVVDEMENRYELFSKIGVRNIAGYNTKVEEFNASSEQK 
QMPLPLIVVrVDELADLMMVASKEVEDAHRLGQKARAAGffiMILATQRPSVD 

25 VISGLIKANVPSRIAFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRL 
QGSFISDDDVERIVGFIKDQAEADYDDAFDPGEVSETDNGSGGGGGVPESDPL 
FEEAKGLVLETQKASASMIQRRLSVGFNRATRLMEELEAAGVIGPAEGTKPRK 
VLMTPTPSE* 

30 

Sequence description: 

A] Length: 2451 bp - 817 aa (Full-length gene) 

B] Shine Dalgarno sequence present upstream of 
35 ATG start codon, possesses a potential signal 

peptide 

ID- 104 

40 

Clone 2-18/22b 



(sea -xo mo '.5^ 

ATGTCACAAGAGCAAGGAAAAATTTATATTGTAGAAGATGATATGACGAT 
TGTGTCACTTTTAAAAGATCATTTATCAGCTAGCTATCATGTCTCTAGTGTC 
AGCAATTTTCGTGATGTGAAACAAGAAATTATCGCATTTCAACCCGATTTG 
ATACTAATGGATATTACGTTACCCTATTTTAATGGTTTTTACTGGACTGCAG 
5 AATTGCGTAAGTTTTTAACAATTCCTATTATTTTCATTTCATCTAGTAATGA 
TGAAATGGATATGGTTATGGCATTAAATATGGGGGGTGATGACTTTATTTC 
AAAACCATTCTCTCTAGCTGTATTAGATGCTAAGCTAACTGCTATTTTAAG 
GAGAAGTCAACAATTTATCCAACAGGAATTAACTTTTGGGGGATTTACGTT 
GACAAGAGAAGGGTTATTGTCTAGCCAAGATAAAGAGGTTATTTTATCGC 
1 0 C AAC AG AAAATA A A ATCCT ATCTATCTTGCTC ATGCATCCTAA AC AAGT AG 
TCTCAAAAGAGTCTCTATTAGAGAAACTTTGGGAAAATGATAGTTTTATTG 
ATCAAAATACACTTAATGTTAATATGACACGCTTACGTAAAAAAATTGTCC 
CAATAGGTTTTGATTACATTCATACAGTGAGAGGAGTTGGGTATTTACTAC 
AATGA 

15 

MSQEQGKIYIVEDDMTIVSLLKDHLSASYHVSSVSNFRDVKQEIIAFQPDLILM 
DITLPYFNGFYWTAELRKFLTIPIIFISSSNDEMDMVMALNMGGDDFISKPFSLA 
VLD AKLT AILRRS QQFIQQELTFGGFTLTREGLLS S QDKE VILSPTENKILS ILLM 
20 HPKQVVSKESLLEKLWENDSFIDQNTLNVNMTRLRKKIVPIGFDYIHTVRGVG 
YLLQ* 



Sequence description: 

25 

A] Length: 669 bp - 223 aa (full-length gene 
sequence) 

B] Shine Dalgarno sequence present upstream of a GTG start codon. 
Was not identified directly by LEEP. This gene was found upstream of 

30 gene ID- 10 described in WO 00/06736. 



ID- 105 
35 Clone 2-20 

(SEEL J® (\)6> '. 51 ) 

ATGTATCAAACTCAGACAAATAAGGAAAAATTTGTTTTATTTTTGAAATTA 
TTTATCCCAGTATTGATTTATCAATTTGCTAATTTTTGAGCTACTTTTATTGA 
40 TTCGGTTATGACTGGACAGTATAGTCAGCTACATTTGGCAGGTGTGTCAAC 
TGCTAGTAATTTATGGACTCCGTTTTTCGCTTTATTAGTAGGTATGATTTCA 
GCATTAGTACCAGTAGTTGGTCAACATTTGGGTAGAGGAAATAAAGAACA 
AATTCGCACAGAATTTCATCAATTTCTATATTTAGGTTTGATACTGTCCTTA 
ATATTATTTTTAATCATGCAATTTATTGCTCAACCTGTCTTGGGGAGTTTGG 



GTTTAGAAGATGAAGTTCTAGCAGTTGGTCGTGGTTATTTAAATTATATGT 
TGATTGGAATCATGCCGCTGGTGTTGTTTAGCATTTGCCGTTCATTCTTTGA 
TGCATTGGGGTTAACAAGGTTATCTATGTATCTGATGCTTTTAATTCTACCC 
TTTAATTCATTTTTTAATTATATGCTTATCTACGGTAAATTTGGTATGCCTA 
5 GACTAGGAGGTGCGGGGGCAGGTCTTGGAACTTCTTTAACTTATTGGGCTA 
TTTTTATTGGTATTATTATTGTGATGTCACTTCATCCTCAAATTAAAACATA 
TCATATATGGACTCTGGAAAGAATAAAAGCTCCTTTGATTATTGAAGATAT 
TCGATTGGGATTACCGATTGGTTTACAAATTTTTGCAGAAGTTGCAATTTTT 
GCAGTAGTAGGCTTATTCATGGCAAAATTTTCTTCAATCATTATTGCAGCA 

1 0 C ATCAGGCTGCTATG A ATTTTTC ATC ATT A ATGTATGC ATTTCCTTT A AGTA 
TTTCCACTGCTCTAGCTATTACAATATCGTTTGAAGTAGGGGCAGAGCGCT 
TTCAGGACGCAACCACTTATAGTAGGATAGGACGCTTAACAGCGGTAGGG 
ATTACATCAGGAACCTTACTATTTTTATTTCTATTTCGTGAGAATGTAGCAG 
CAATGTATAATAGTGCCCCTCACTTTGTCGCTATTACAGCTCAATTCCTAAC 

1 5 TTAT AGTCTCTTTTTCC AGTTTGC AG ATGCTTATGC AGCTCCTGT AC AGGGG 
ATTTTACGAGGCTATAAGGATACAACAAAACCATTTATGATCGGTGCGGG 
CTCTTATTGGTTATGTGCTTTGCCATTAGCGGTTATCTTAGAAAAAAATAG 
CCAGTTAGGTCCGTTTGCCTATTGGATTGGTTTAATCACAGGTATTTTTGTT 
TGTGGTCTATTTCTAAACCAACGTCTGCAAAAGATTAAGAAGTTGTATTAT 

20 TAA 

(SeSi 30 bO) 

MYQTQTNKEKFVLFLKLFIPVLIYQFANFSATFIDSVMTGQYSQLHLAGVSTAS 

NLWTPFFALLVGMISALVPVVGQHLGRGNKEQIRTEFHQFLYLGLILSLILFLI 

MQFIAQPVLGSLGLEDEVLAVGRGYLNYMLIGIMPLVLFSICRSFFDALGLTRL 

25 SMYLMLLILPFNSFFNYMLIYGKFGMPRLGGAGAGLGTSLTYWAIFIGIirVMS 
LHPQIKTYHIWTLERIKAPLIIEDIRLGLPIGLQIFAEVAIFAVVGLFMAKFSSniA 
AHQAAMNFSSLMYAFPLSISTALArriSFEVGAERFQDATTYSRIGRLTAVGITS 
GTLLFLFLFRENVAAMYNSAPHFVAITAQFLTYSLFFQFADAYAAPVQGILRG 
YKDTTKPFMIGAGSYWLCALPLAVILEKNSQLGPFAYWIGLITGIFVCGLFLNQ 

30 RLQKIKKLYY* 



Sequence description: 

35 

A] Length: 1341 bp - 447 aa (full length gene) 

B] Shine-Dalgarno sequence present upstream of 
ATG start codon, There is a potential signal 
peptide sequence 

40 



ID- 106 



Clone 2-4A 



ISM- 3D <VQ ■ JhU 

5 TTGCTAGTTTCTTCTCTAGTTTCTTGTTCATTTTTTCTTGTCATTTCGTCGTT 
GTCTTCATCAACACGAAATAAGTCTATAAACTTATCAAATAATTTCATAGA 
CTTATTATATCAATTTTCAATAAAATGCTATAATAAAACCATGTCATTTTCA 
TTAAAAATTAGAAATCCATACGGTGAACATACCGTTAAAGAACTCCTTGA 
AGATTATTTTTTGATTCCACGTAAGATTAGACATTTTTTGCGTGTTAAAAAA 

1 0 C ATGT ACTTAT A A AC A ATGAATTC ATTA ATTGGC A AACTGTCGTCC A AG A A 
AACGATACTATTACCTTAATCTTTGATGATGAGGATTACCCTACTAAAAAA 
ATTCCTCTGGGCAGAGCAGAGCTTATTGATTGTCTTTATGAGGATGAACAT 
CTTATTATCGTTAATAAACCTGAAGGTATGAAAACTCACGGTAACCAACCA 
AATGAAATAGCACTGTTAAATCATGTATCTGCCTATTCTGGACAAACATGC 

1 5 T ATGTTGTTC ATCGCCT AG ATATGGAGACC AGTGG AGCTGTTTT ATTTGCT 
AAAAATCCATTTATACTTCCCCTTATCAATCAACGCTTAGAACGAAAAGAA 
ATTTGGCGTGAATATTGGGCTTTAGTTGAAGGAAAATTTTCACCTAAGCAT 
CAAGTTTTGAGAGACAAAATTGGACGGAACCGTCATGACAGACGTAAACG 
AATCATTGATTCTAAAAACGGTCAACATGCTATGACAATCATTGACGTTTT 

20 GAAGTATATCCAAAATAGTAGTCTCATAAAATGCCGACTGGAAACCGGAA 
GAACCCATCAAATTCGCATTCACTTATCTCATCACGGACATCCTTTAATAG 
GAGATCCCCTCTACAACCCTTCTTCTAATAATGAAAGGTTAATGCTACACG 
CTCACCGATTGACTCTATCCCATCCATTAACTTGCGAAACTATTAGCGTAG 
AGGCCCCTTCATCTACTTTCGAGAAGGTTTTAAACAATTATAAAAAAGGAG 

25 TTGGATAA 

ASgfl TO 1*0'. (£) 

MLVSSLVSCSFFLVISSLSSSTRNKSINLSNNFIDLLYQFSIKCYNKTMSFSLKIR 
NPYGEHTVKELLEDYFLIPRKIRHFLRVKKHVLINNEFINWQTVVQENDTITLIF 
DDEDYPTKKIPLGRAELIDCLYEDEHLnVNKPEGMKTHGNQPNEIALLNHVSA 
30 YSGQTCYVVHRLDMETSGAVLFAKNPFELPLINQRLERKEIWREYWALVEGKF 
SPKHQVLRDKIGRNRHDRRKRHDSKNGQHAMTIIDVLKYIQNSSLIKCRLETG 
RTHQIRIHLSHHGHPLIGDPLYNPSSNNERLMLHAHRLTLSHPLTCETISVEAPS 
STFEKVLNNYKKGVG* 

35 

Sequence description: 

A] Length: 1029 bp - 343 aa (Full length gene sequence) 

B] No obvious Shine-Dalgarno sequence upstream 

40 of the putative TTG start codon. Possesses a 

potential leader peptide sequence. 



ID- 107 



Clone 2-54 

5 

(seqli&wd: (,3) 

GAACTAAATGCAACTCAACCTAATAATAGAACTACCTATATTATACCCGAA 

AGCAGTCATTCCATTGCAGAACAACAGAGATTCCTGATAGAATCAAAGGG 

TTCTTCGGTTGCATTACTTAATAGCGATGAATTTAGAAAGACAGCGGGAGA 

1 0 GG AT AG AGGTTTTG A AAGGG AT A AGTTG AGGTCTTTGG AT ATC ATTCCT A A 
GGGAGATTTATCGACAAGTAATGTCATAGGTAATACGGACATTGCTAGTC 
AGATATCGTTGGGCTTTAAAAAGAATGCGATGCAGGAACACCATCTTACT 
AAAACATTCTCTCAAAAGGATGGAAAGTTATCGTCTGTTATAGAGGGGAT 
GCTTGCTATTGGCAAAGAGAAAGTAGAGAAAGAAATAAAATATAGTGGTA 

1 5 ATTTATGGC AA AA ATTAA AAGCTA AGGCAC ACTGCCTTGTTTGCTGTGTTG 
ATAATTTGAATTTTGAAGATATAAAATCTTATTTTCAATATTATTGTCATCT 
AAACCATCAGCTCAAATTACCTAAAGGTGCTATACTTTCTGCTAAAACAGA 
AGTATATAGGGGAGGAGATTTTGGGAGAAAAAATAAAGATAATGTGTTTG 
GTTACCGTATCCCCTCATTATTGAAAACCCAAAAAGGAACTTTACTTGCGG 

20 GAGCTGATGAAAGAATTGAGCAAGCTTGTGATTGGGGAAACATAGGAATG 
GTTATTCGCCGTAGTGAGGATGATGGTGTCACTTGGGGAAAAAGAGAAAC 
TATTGTCAATCTCCGTAATAACCCTAGAGTTCCGCTAGTTACTAGTGGTGA 
CTATAGTGGCTCACCTATTAATATGGATATGGCATTAGTTCAAGATACTAG 
CTCCAAGACGAAACGTATTTTTTCAATATATGATATGTTTCCAGAAGGAAG 

25 AGGCGTTATTAGTATTGCTAACACACCTGAAAAAGAATATACCCAAATCG 
GAGGACAGTCTTATCTTAATTTATATAATAATGGAAAGAAATCGAAGGTTT 
TTACTATCCGTGACAAAGGTATTGTATATAATTTTAAAGGGAAAAAGAGTG 
ATTATCATGTTATAACAGAAACTACTAAAAGTGACCATTCAAATCTAGGGG 
ATATTTATAAGGGAAAACAGCTACTTGGAAATATATATTTTACAAAACATA 

30 AAACGTCACCATTTCGTTTAGCAAAATCAAGCTATGTGTGGATGTCATATA 
GCGATGATGATGGTAGGACATGGTCATCACCTAGAGATATAACAGCAAGT 
CTTCGTCAGAAAGGCATGAAATTTTTGGGAATAGGACCTGGAAAAGGTAT 
AGTTTTAAAATGGGGGCCACACGCTGGTCGTATTATTATTCCTGCCTATTCT 
ACGAATTGGAAATCTCATCTAAGAGGTTCACAATCTTCACGCCTAATTTAT 

35 TCAGACGACCATGGAAAAACGTGGCATACTGGAAAAGCAGTTAATGATAA 
CCGTATACTTTCTAATGGTGAAAAAATTCACTCCTTAACAATGGATAATAA 
AAAAGAACAAAATACAGAATCCGTACCCGTTCAATTGAAAAATGGGGACA 
TTAAGTTATTTATGAGGAATCTAACTGGTAACCTAGAAGTAGCCACAAGTA 
AAGACGGCGGGGAGACTTGGCAAAACCATGTTAAACGATATAAGGAAATT 

40 CATGATGCTTACGTCCAACTATCAGCTATTCGCTTTGAGCATGACAAAAAA 
GAGTATATTTTATTAGTGAATGCTAATGGGCCAGGGAAGAAGTGCCAAGA 
TGGATATGCACGTCTAGCGCAAGTTAATCGAAATGGTAGTTTTAAGTGGTT 
ATATCACCATCACATTCAAGATGGTTCGTTTGCTTACAACTCTGTTCAACA 
ACTTAATAATGATCAATTTGGTGTCCTTTATGAACATAGAGAAAAACATCA 



AAATAGTTTTACTTTAAATTACAAAGTTTTTAATTGGAGTTTTCTTAGTCAA 
AATACAGAGAAGCAAGGCACTTTATGGGAGAAAATGGCAGCAAATTGGCA 
TGTTTTGTTTAAATTTTATTTATGA 

5 tsex :ed no'. <o*4) 

ELNATQPNNRTTYIIPESSHSIAEQQRFLffiSKGSSVALLNSDEFRKTAGEDRGF 
ERDKLRSLDIIPKGDLSTSNVIGNTDIASQISLGFKKNAMQEHHLTKTFSQKDG 
KLS S VIEGML AIGKEK VEKEIK YS GNLWQKLKAKAHCLVCC VDNLNFEDIKS 
YFQYYCHLNHQLKLPKGAELSAKTEVYRGGDFGRKNKDNVFGYRIPSLLKTQ 

1 0 KGTLLAG ADERIEQ ACDWGNIGM VIRRSEDDG VTWGKRETIVNLRNNPRVPL 
VTS GD YS GSPINMDM AL VQDTSS KTKRIFS I YDMFPEGRG VIS IANTPEKE YTQI 
GGQSYLNLYNNGKKSKVFTIRDKGrVYNFKGKKTDYHVITETTKSDHSNLGDI 
YKGKQLLGNIYFTKHKTSPFRLAKSSYVWMSYSDDDGRTWSSPRDITASLRQ 
KGMKFLGIGPGKGIVLKWGPHAGRIIIPAYSTNWKSHLRGSQSSRLIYSDDHG 

1 5 KTWHTGKA VNDNRILSNGEKIHSLTMDNKKEQNTES VPVQLKNGDIKLFMRN 
LTGNLEVATSKDGGETWQNHVKRYKEfflDAYVQLSAIRFEHDKKEYILLVNA 
NGPGKKCQDGYARLAQVNRNGSFKWLYHHHIQDGSFAYNSVQQLNNDQFG 

VLYEHREKHQNSFTLNYKVFNWSFLSQNTEKQGTLWEKMAANWHVLFKFYL 

* 

20 

Sequence description: 

A] Length: 2052 bp - 684 aa (partial gene sequence) 

i 

B] N-terminus has yet to be determined 

25 

ID-108 



30 Clone 2-61 

(SBL TO NO'- lfi5) 

ATGCCTAAATTAATCGTATCTTTCCTCTGCATTTTATTATCCCTGACTTGTG 
TAAACTCTGTGCAAGCTGAAGAACATAAAGATATTATGCAAATTACCCGA 

35 GAAGCCGGATATGATGTTAAAGATATTAATAAACCTAAAGCGTCTATCGTT 
ATTGACAATAAAGGTCATATTTTGTGGGAAGATAACGCCGATTTAGAACGT 
GATCCCGCTAGCATGTCTAAAATGTTTACTTTATATTTACTATTTGAAGACT 
TAGCTAAAGGAAAAACAAACCTCAACACCACAGTGACTGCAACAGAAACA 
GAGCAAGCGATAAGTAAGATTTATGAAATTAGTAATAAGAATATTGATGCT 

40 GGGGTTGCTTATCCTATTCGTGAACTGATTACTATGACGGCTGTCCCGTCA 
TCTAATGTAGCAACTATTATGATTGCTAACCACTTATCACAAAACAATCCT 
GACGCCTTTATTAAACGAATCAATGAAACCGCCAAGAAACTCGGTATGAC 
AAAAACTCACTTTTATAACCCCAGTGGGGCGGTAGCGAGTGCTTTTAATGG 
ACTTTACTCCCCAAAAGAATACGATAACAATGCTACTAACGTTACGACTGC 



ACGTGATCTATCAATTTTAACCTATCATTTCCTTAAAAAATACCCTGATATA 
CTGAACTATACAAAATATCCTGAAGTCAAGGCCATGGTCGGAACTCCTTAT 
GAAGAAACATTTACAACTTATAACTACTCTACCCCCGGCGCTAAATTTGGA 
TTAGAAGGAGTAGATGGCTTAAAAACTGGTTCTAGCCCTAGCGCTGCTTTT 
5 AATGCCTTAGTTACAGCTAAACGCCAGAATACTCGCTTGATAACTGTGGTT 
TTAGGAGTTGGCGATTGGTCAGACCAAGACGGAGAGTACTATCGTCATCC 
GTTTGTCAACGCTCTTGTAGAAAAAGGTTTTAAAGACGCTAAAAATATTTC 
TTCTAAAACTCCTGTATTAAAAGCCGTTAAACCTAAAAAAGAAGTTACTAA 
AACCAAAACTAAATCTATTCAAGAACAGCCTCAAACAAAAGAACAGTGGT 
1 0 GGAC A AAA AC AG ATC AATTT ATCC A ATC ACATTTTGTATCTATTTT A ATTG 
TTCTGGGCACCATCGCTAGCCTTTGTCTTTTAGCTGGGATAGTATTACTTAT 
AAAGCGCTCTAGATAA 

](^I^S^C^LSLTCVNSVQAEEHKDIMQITREAGYDVKDINKPKASIVIDN 
1 5 KGHILWEDN ADLERDP ASMSKMFTLYLLFEDLAKGKTNLNTT VT ATETDQAI 
SKIYEISNNNIHAGVAYPIRELITMTAVPSSNVATIMIANHLSQNNPDAFIKRINE 
TAKKLGMTKTHFYNPSGAVASAFNGLYSPKEYDNNATNVTTARDLSILTYHF 
LKKYPDILNYTKYPEVKAMVGTPYEETFTTYNYSTPGAKFGLEGVDGLKTGS 
SPSAAFNALVTAKRQNTRLITVVLGVGDWSDQDGEYYRHPFVNALVEKGFK 
20 DAKNISSKTPVLKAVKPKKEVTKTKTKSIQEQPQTKEQWWTKTDQFIQSHFVS 
ILIVLGTIASLCLLAGIVLLIKRSR* 



25 Sequence description: 

A] Length: 1 188 bp - 396 aa (full length gene) 

B] Shine Dalgarno sequence present upstream of 
ATG start codon, possesses a potential signal 

30 peptide 



ID- 109 

35 

Clone 45 

■ (se& zo W. &?) " 

40 ATGACTGAAAAATATTATAATTGGGCAACGCTTGGAACCGGCGTTATTGCC 
AACGAATTAGCCCAAGCACTGGAAGCACGTGGACAAAAATTATATTCTGT 
AGCTAATAGAACTTACGACAAAGGACTTGAATTTGCTAACAAATATGGTA 
TCCAAAAAGTTTATGATCACATAGATCAAGTATTTGAAGACCCTGAAGTGG 
ATATCATTTATATCTCTACTCCCCACAATACTCACATCTCATTTTTACGAAA 



GGCTTTAGCAAATGGTAAGCACGTTCTTTGCGAAAAATCTATTACTTTAAA 
TAGTACTGAGCTTAAAGAAGCCATAGATTTAGCCGAAACTAACCATGTTGT 
CTTAGCTGAAGCCATGACTATTTTTCATATGCCAATTTACCGCCAATTAAA 
AACATTAGTTGATAGTGGAAAATTAGGACCGTTAAAAATGATTCAAATGA 
5 ATTTCGGAAGTTATAAAGAATATGATATGACTAACCGTTTTTTCAGTCGTG 
ACCTAGCAGGCGGTGCTTTGCTGGACATTGGTGTTTATGCACTTTCTTGTAT 
TCGCTGGTTTATGTCAGAAGCACCTCACAACATTACCTCTCAAGTTACATT 
TGCACCAACAGGGGTTGATGAACAAGTTGGTATCCTACTAACCAACCCAG 
CAAATGAGATGGCGACTGTCAGCCTTAGTTTACATGCAAAACAACCTAAA 

1 0 CGAGC A ACT ATCGCTT ACG AT A A AGGCT AC ATTG A ACTTTTTG A AT ATCCG 
CGAGGACAAAAGGCAGTTATTACTTATACTGAGGATGGGCATCAAGATAT 
TATCGAAGCTGGCAAAACTGAAAATGCTCTCCAATATGAGGTAGCTGATA 
TGGAAGAAGCCATTTCAGGAAAAACTAACCACATGTACTTAAACTATACC 
AAAGATGTTATGGATATCATGACACAGCTACGTCAAGAATGGGGATTTAC 

15 CTACCCAGAAGAAGAAAAATGA 

fSEGL 30 W\ taf) 

MTEKYYNWATLGTGVIANELAQALEARGQKLYSVANRTYDKGLEFANKYGI 
QKVYDHIDQVFEDPEVDIIYISTPHNTHISFLRKALANGKHVLCEKSITLNSTEL 
KEAIDLAETNHVVLAEAMTIFHMPIYRQLKTLVDSGKLGPLKMIQMNFGSYK 
20 EYDMTNRFFSRDLAGGALLDIGVYALSCIRWFMSEAPHNITSQVTFAPTGVDE 
QVGILLTNPANEMATVSLSLHAKQPKRATIAYDKGYffiLFEYPRGQKAViTYT 
EDGHQDIffiAGKTENALQYEVADMEEAISGKTNHMYLNYTKDVMDIMTQLR 
QEWGFTYPEEEK* 

25 

Sequence description: 

A] Length: 984 bp - 328 aa (full length gene) 

B] Shine Dalgamo sequence present upstream of 
30 ATG start codon, possesses a potential signal 

peptide 



35 ID-110 



Clone 2-2 

GTGTATTCTCCTGTTAAATCTTCTAAAGGAAAAGTGATATTGTTAAAAAGT 
GATTTTCTAAAGAGCTTCATAGAAAGGAGAGGAAATATTTGTTTT 

( 5EQ. to \ TV) 

MYSPVKSSKGKVILLKSDFLKSFIERRGNICF 



Sequence description: 

5 A] Length: 96 bp - 32 aa (partial sequence) 

B] GTG start codon - no obvious Shine-Dalgarno 
sequence 

Possesses a potential signal peptide 

10 

ID-Ill 



15 Clone 2-3 

AAATACTGTATCATTGCAACCTCAAATGCAGGTTTTGGAAACGAAGCATTT 
ACAGGTGACAGCGATAAAGACTTGAAAATTATGGAACGAATTTCTCCATA 

20 TTTCCGTCCAGAATTTCTAAATCGTTTCAATGGTGTTATTGAATTCTCTCAC 
CTAAGCAAAGATGACTTAAGCGAAATTGTAGATTTGATGCTTGATGAAGTT 
AACCAAACAATTGGCAAAAAAGGAATTGACCTTGTGGTAGATGAAAATGT 
TAAATCACACTTAATTGAACTGGGTTATGACGAAGCAATGGGAGTACGTC 
CATTGCGCCGTGTCATCGAGCAAGAAATTCGAGATCGCATCACAGACTACT 

25 ATCTCGATCATACAGACGTTAAACACCTAAAAGCTAATTTGCAAGATGGCC 
AAATCGTCATTTCTGAAAGATAA 

c sea no'. 3£>) 

KYCIIATSNAGFGNEAFTGDSDKDLKIMERISPYFRPEFLNRFNGVIEFSHLSKD 
DLSEIVDLMLDEVNQTIGKKGIDLVVDENVKSHLIELGYDEAMGVRPLRRVIE 
30 QEIRDRITDYYLDHTDVKHLKANLQDGQIVISER* 



Sequence description: 

35 A] Length: 429 bp - 143 aa (partial sequence) 

B] N-terminus yet to be elucidated. This gene 
was not in frame with nuc 



40 

ID-112 



Clone 2-5 



(SE& ZED N0' ^3) 

ATGTCAATGAATTTTTCATTTTTACCACAATATTGGTCCTATTTTAATTATG 
GTGTGATGGTAACCATTATGATTTCAACATGTGTTGTTTTTTTTGGAACTAT 
TATAGGCGTGTTAATTGCTTTAGTAAAGCGTACTAATTTACATTTTCTCACA 
ATATTAGCTAATTTCTATGTATGGGTATTTCGTGGGACACCGATGGTAGTT 
5 CAAATTATGATTGCTTTCGCATGGATGCATTTTAACAATTTACCAACAATT 
AGCTTTGGTGTTTTAGATTTAGATTTTACACGACTTTTACCTGGTATCATTA 
TCATTTCCTTAAATAGTGGTGCCTATATTTCGGAAATTGTACGTGCAGGGA 
TTGAGGCTGTACCATCTGGACAAATAGAAGCAGCTTACTCGTTGGGGATTC 
GACCTAAAAATACACTTCGCTATGTTATCTTACCCCAAGCTTTTAAAAATA 
1 0 TTTT ACCTGCTCTAGGGAATG A ATTT ATT AC A ATT ATT A AAG ATAGTGCTCT 
CCTTCAAACTATTGGTGTCATGGAATTATGGAACGGAGCACAATCAGTTGT 
AACGGCTACTTACTCACCAGTTGCACCGTTATTATTTGCAGCATTTTACTAT 
TTAATGTTGACAACGATTCTCTCAGCTTTGTTAAAACAAATGGAGAAATAT 
CTTGGGAAAGGGGTAAAAATAGATGGTTGA 

15 (SBGirW fty) 9W) 

MSMNFSFLPQYWSYFNYGVMVTIMISTCVVFFGTIIGVLIALVKRTNLHFLTIL 

ANFYVWVFRGTPMVVQIMIAFAWMHFNNLPTISFGVLDLDFTRLLPGnilSLNS 

GAYISEIVRAGIEAVPSGQIEAAYSLGIRPKNTLRYVILPQAFKNILPALGNEFITI 

IKDSALLQTIGVMELWNGAQSVVTATYSPVAPLLFAAFYYLMLTTILSALLKQ 

20 MEKYLGKGVKIDG* 



Sequence description: 

25 

A] Length: 699 bp - 233 aa (full length gene) 

B] Shine-Dalgarno sequence preceded the 'ATG' 
start codon. Possesses a potential leader peptide 
sequence. 



DD-113 
35 Clone 2-7 

LSEfLZO NO'. IS) 

ATGAAAGACCTATTACGAAATAGTCTAGAGCAAAGTGGAAATTTAAGTTT 
TCAAGATATGATTTTACATATTCTTGTAGCAGCTTTATTGAGTGTAGTTATT 
40 TATGTTTCCTATGCTTATACGCATAGTGGAACTGCCTATAGTAAAAAGTTT 
AATGTTTCATTAATGACATTGACGGTCTTGACTGCAACAGTAATGACCGTT 
ATTGGTAATAATGTAGCCTTGTCATTGGGTATGGTCGGTGCCTTGTCAGTT 
GTTCGTTTTAGGACAGCCATAAAAGATTCAAGAGATACAGTTTATATTTTT 
TGGACCATAGTTGTTGGTATCTGTTGTGGTGTCGGTGACTATGTGGTAGCT 



GCATTAGGAAGTAGCGTTATCTTTATCTTATTATGGGTTATGGGACGTGTT 

AAAAACGAGAATCGTATGTTATTGATTGTGAAGTGCGATAGAACACTAGA 

AGTTGATTTAGAAGGAATTTTCTTCCAATATTTTGACGGAAAAGCTGTTCA 

GCGTGTTAAAAATTCAACAACTAATACTATTGAAATGATTTTCGAAATCTC 

TAGAAAAGATTACGATAAGCAACTCCATGTAGATAATCAGTTAACTGAAA 

AAGTGTACCAATTGGGAAATATTGATTATTTCAACATTGTTAGCCAAAGCG 

ACGAAATCAATGGGTAG 

MKI5LLRNSLEQSGNLSFQDMILHILVAALLSVVIYVSYAYTHSGTAYSKKFNV 
SLMTLTVLTATVMTVIGNNVALSLGMVGALSVVRFRTAIKDSRDTVYEFWTIV 
VGICCGVGDYVVAALGSSVIFILLWVMGRVKNENRMLLIVKCDRTLEVDLEGI 
FFQYFDGKAVQRVKNSTTNTIEMIFEISRKDYDKQLHVDNQLTEKVYQLGNID 
YFNIVSQSDEING* 



Sequence description: 



ID-114 
Clone 2-8 

(set 30 w. iti 

AAAAATTCATTTTAGATTCATTTTACGACTATATACTCAGAAGTACCAAAC 

CTAATCCAAGGTTTGAAAAAAGAAAGAAGGAAGTCAGTATGACAAACTAT 

AAAAACAACTTTAAAGATGAGGCTATACGTGTTGAAGAGACAACAAAAGA 

ATCATTTTACGATGTTGATATTGCCTTGTTTTCAGCTGGTGGATCTATTTCA 

GCAAAGTTCGCTCCTTATGCAGTAAAGTCTGGAGCAGTTGTAGTAGATAAC 

ACGTCATATTTTCGTCAGAATCCTGATGTTCCACTAGTTGTTCCTGAAGTAA 

ATGCTCATGCCATGATTGGTCATAATGGTATCATAGCTTGTCCCAATTGTTC 

TACTATTCAAATGATGATTGCTTTAGAGCCCATTCGTCAAAAATGGGGGAT 

AGAGCGTGTTATAGTTTCCACCTATCAAGCTGTTTCGGGTTCAGGTGCACG 

TGCTGTTGAAGAAAGTAAGGAAGAGTTGAGACAAGTTTT 

(set a^o too '. tf) 

KFILDSFYDYILRSTKPNPRFEKRKKEVSMTNYKNNFKDEAIRVEETTKESFYD 
VDIALFSAGGSISAKFAPYAVKSGAVVVDNTSYFRQNPDVPLVVPEVNAHAMI 
GHNGIIACPNCSTIQMMIALEPIRQKWGIERVIVSTYQAVSGSGARAVEETKEQ 
LRQV 



A] Length: 678 bp - 226 aa (full-length gene) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence-Possesses a potential leader 
peptide sequence 



Sequence description: 

5 A] Length: 499 bp - 165 aa (partial sequence) 

B] N-terminus has yet to be determined 



ID-115 

10 

Clone 2-9 

(SEk3VH0\ 

1 5 ATGAC AAATG AATTG AT AATGC AAGCTTTTG AGTGGT ATTTACCT AGTG AT 
GGGAATCACTGGAAGAAATTAGAGGAGTCTATATCAGACCTTAAAAAACT 
TGGAATTAGTAAAATCTGGTTACCACCAGCATTTAAGGGAACTAGCAGTG 
ATGATGTAGGATATGGTGTTTATGATCTCTTTGATTTAGGAGAATTTGACC 
AGAATGGAACAATTAGAACAAAATATGGTAGGAAAGAAGAGTATCTAAA 

20 GCTTATTAAGTCGTTAAAGGCAAATGGCATTAAACCGTTTGCAGATATCGT 
TCTTAACCATAAAGCCAATGGTGATCATAAAGAAAAATTTCAAGTCATCA 
AAGTCAATCCTGAAAATCGTCAAGAAGCATTAAGTGAACCCTATGAGATT 
GAAGGATGGACGGGATTTGATTTCCCAGGTAGACAGGGTGAGTACAATGA 
TTTT 

MTNELIMQAFEWYLPSDGNHWKKLEESISDLKKLGISKIWLPPAFKGTSSDDV 
GYGVYDLFDLGEFDQNGTERTKYGRKEEYLKLIKSLKANGIKPFADIVLNHKA 
NGDHKEKFQVIKVNPENRQEALSEPYEffiGWTGFDFPGRQGEYNDF 

30 

Sequence description: 

A] Length: 456 bp - 152 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 

35 Dalgarno sequence, no leader peptide sequence. 



1D-116 

40 



Clone 2-10 



CSEOl so tOb '• ^l) 

ATGGAGGTTCTTATGAAGAAAGTGTTAGTAAGTAGTCTTTTGGTTTTAGGG 
ATTACGATAACGTTACAACCAGTAGTTGAGGCTAAGGGGCCAAAAGTAGC 
TTATACACAAGAGGGAATGACTGCTCTTTCGGACACAAATAAAGATAAAG 
TCACTACTATTTCTATTGACGAGATTCAAAAAAGCTTAGAAGGTAAGAAGC 
5 CGATTACTGTTAGTTTTGATATTGATGATACACTGCTTTTCAGTAGTCAATA 
TTTTCAATATGGTAAAGAATATGTAACTCCTGGATCGTTTGATTTTCTTCAT 
AAACAAAAATTCTGGGATCTTGTTGCAAAACGAGGAGATCAAGATTCCAT 
TCCCAAAGAATATGCTAAAAAATTAATTGCTATGCATCAAAAACGAGGAG 
ATAAAATTGTTTTTATAACAGGTAGGACAAGAGGGTCAATGTATAAGGAG 
1 0 GGCG AGGTTG ATA A A AC AGCTAA AGCCTT AGCTA AAG ATTTTA AATTTGTA 
CCATCTGAT 

MEVLMKKVLVSSLLVLGrriTLQPVVEAKGPKVAYTQEGMTALSDTNKDKVT 
TISIDEIQKSLEGKKPITVSFDIDDTLLFSSQYFQYGKEYVTPGSFDFLHKQKFW 
1 5 DLV AKRGDQDSIPKEYAKKLIAMHQKRGDKIVFITGRTRGSM YKEGEVDKTA 
KALAKDFKFVPSD 



Sequence description: 

20 

A] Length: 516 bp - 172 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence, Possesses a leader peptide 
sequence. 

25 

ID-117 



30 Clone 2-17 

ATGCTTAAAAGATTATTTACTGAAGATGGGGAATTGACAAAGATTAGTCGT 
CGTTTCGTTTGGATGTTAGTGGTTATCTATTGTCTTATTATTGTCAGGATGT 

35 GTTTTGGGCCTCAAATTATGATTGAGGGGGTATCAACTCCGAATGTTCAGC 
GCTTCGGAAGAATTGTAGCTCTTTTAGTACCATTTAATTCTTTTCGTAGTTT 
AGATCAGCTAACTAGCTTTAAAGAGATTCTTTGGGTTATTGGTCAAAATGT 
AGTGAATATTTTACTGCTGTTTCCTCTCATTATAGGGTTACTATCCCTAAAG 
CCAAGTTTACGGAAATATAAAAGCGTTATATTACTTGGTTTGTTGATGTCTC 

40 TTTTCATAGAGTGTACTCAAGTTGTTTTAGATATTTTAATAGATGCTAATCG 
GGTTTTTGAAATCGACGATCTATGGACAAATACCTTAGGCGGTCCTTTCGC 
CCTATGGAGTTATCGAAACATAAAAGGTTGGCTTCTAACTATTAGAAAATG 
A 



(SBSu^D W\ pi) 

MLKRLFTEDGELTKISRRFVWMLVVIYCLIIVRMCFGPQIMEGVSTPNVQRFG 
RrVALLVPFNSFRSLDQLTSFKEELWVIGQNVVNILLLFPLIIGLLSLKPSLRKYK 
SVBLLAFLMSLFIECTQVVLDILIDANRVFEIDDLWTNTLGGPFALWSYRNIKG 
WLLTIRK* 



Sequence description: 

A] Length: 516 bp - 172 aa (full-length gene) 

B] ATG start codon is preceded by an Shine- 
Dalgamo sequence. Possesses a potential leader 
peptide sequence. C-terminus need further 
confirmation. 



ID-118 



Clone 3-3 
(SE& 3V NO '• <?S) 

ATGAAAAAGCTTACTTTTATTTGGGATTTAGATGGGACATTAATAGATTCG 

TATGTACCAATTATGGAAGCTCTTGAAGAAACCTATCGTCATTTTGGCTTA 

ATATTTGATAAAGAATTAATCCATGAATATATTTTACAGGAATCAGTGGGG 

CAATTATTGGTAAACCTTTCAGAGGAAGAGCAAATACCTCATGAAAAACT 

GAAAGCATATTTTACAAAAGAACAAGAAAGTCGAGATTCTAAAATACATT 

TAATGCCATATGCAAAAGAGATTTTAGAATGGACCAAAGAACAAGATATT 

CCCAATTTTATGTATACACATAAAGGAGCAAGTACGCATTCAGTGTTGGAA 

ACCTTGCAGATCTCTCATTATTTTGATGAAATTTTAACTGGTGTTTCGGGAT 

TCGAGCGAAAACCACATCCACAAGGGATTAATTATTTAGTTAAACGATATT 

CTTTAGATAAATCAATGACTTATTACATAGGAGATCGTCCACTAGATTTGG 

AGGTTGCTCAAAATGCTGGTATAAAATCCATAAACTTAAGGTTAGAGAATT 

CCAAAGAAAACTATAATATTTCAAGTCTCAAAGATATAATATCACTTGATT 

TCACTCGTTTGGATTAA 

tseazo No'.fk) 

MKKLTFIWDLDGTLIDSYVPIMEALEETYRHFGLIFDKELIHEYILQESVGQLL 
VNLSEEEQIPHEKLKAYFTKEQESRDSKIHLMPYAKEILEWTKEQDIPNFMYTH 

KGASTHSVLETLQISHYFDEILTGVSGFERKPHPQGINYLVKRYSLDKSMTYYI 
GDRPLDLEVAQNAGIKSINLRLENSKENYNISSLKDIISLDFTRLD* 



Sequence description: 

A] Length: 627 bp - 209 aa (Possible Full-length gene) 



B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 



5 

ID-119 



Clone 3-7 

10 

I SESi 30 NO \ 

ATGGAAAAAGAAAAAAAATTAGGTCTTTTACCACTAACAATGCTTGTCATT 

GGCTCTCTTATCGGTGGCGGAATCTTTGATTTAATGCAAAATATGAGTTCC 

AGAGCCGGTTTGGTACCAATGCTTATTGCTTGGGTAATTACTGCTATCGGG 

1 5 ATGGGA ACTTTCGTTTTAAGTTTTC A A AATTT ATCTG AA A A A AGGCCGG AC 
CTAACAGCTGGAATCTTTAGTTACGCTAAAGAGGGGTTTGGAAACTTTATG 
GGATTTAACTCTGCATGGGGTTATTGGTTATCAGCTTGGCTTGGAAATGTT 
GCCTACGCTGCACTCTTATTCAGTTCACTCGGTTATTTCTTTAAATTCTTTG 
GTAATGGAAATAATATCATCTCAATTATTGGAGCAAGTATAGTTATTTGGG 

20 TTGTCCATTTCTTAATTTTAAGAGGTGTTAATACAGCTGCATTTATTAATAC 
CGTAGTTACCTTTGCAAAATTAGTACCTGTTATTATTTTCTTAATTTCAGCG 
TTATTAGCTTTCAAATTTAACATTTTTAGTCTTGATATCTGGGGAAATGGAT 
TACATCAATCAATTTTCAACCAAGTCAATTCAACTATGAAAACCGCTGTTT 
GGGTATTTATTGGTATTGAGGGCGCCGTTGTCTTCTCAGGTCGTGCTAAAA 

25 AACACTCTGATATTGGTAAAGCAAGTATCCTAGCATTATTCACTATGATTT 
CACTTTATGTATTGATTTCTGTTTTATCACTTGGTATCATGTCACGTCCAGA 
ACTTGCAAACTTAAAAACACCAGCTATGGCTTACGTTCTAGAAAAAGCTGT 
TGGTCACTGGGGTGCTATCTTAGTTAACCTTGGTGTTATCATTTCAGTATTT 
GGCGCTATTCTTGCTTGGACTTTATTTGCAGCAGAATTACCATATCAAGCT 

30 GCTAAAGAAGGTGCTTTTCCTAAATTTTTTGCAAAAGAAAATAAAAACAA 
AGCTCCAATCAACTCACTCTTAGTCACTAATCTTTGTGTACAAGCATTCTTA 
ATCACGTTCTTATTCACACAAAGTGCTTATCGTTTTGGTTTCGCATTAGCAT 
CATCTGCTATCTTAATTCCTTATGCTTTTACAGCACTATATCAATTACAATT 
CACACTCCGTGAGGATAAGTCAACTCCAGGACATCAAAAGAATTTAATTA 

35 TCGGTATCCTCGCTACAATCTATGCTGTTTACCTTATCTACGCTGGTGGTTT 
TGATTACTTACTTTTGACAATGATTGCTTATACTCTAGGTATGATTCTCTAT 
ATTAAAATGAGAAAAGATGACAAGCTTGGCGTAATCATGGTCATAGCTGT 
TTCCAGTGTGAAATTGTTATCC 

40 (sebl hd m '. u) 

MEKEKKLGLLPLTMLVIGSLIGGGIFDLMQNMSSRAGLVPMLIAWViTAIGMG 
TFVLSFQNLSEKRPDLTAGIFSYAKEGFGNFMGFNSAWGYWLSAWLGNVAY 
AALLFSSLGYFFKFFGNGNNIISIIGASIVIWVVHFLILRGVNTAAFINTVVTFAK 
LVPVIIFLISALLAFKFNIFSLDIWGNGLHQSIFNQVNSTMKTAVWVFIGffiGAV 



VFSGRAKKHSDIGKASILALFTMISLYVLISVLSLGIMSRPELANLKTPAMAYV 
LEKAVGHWGAILVNLGVnSVFGAILAWTLFAAELPYQAAKEGAFPKFFAKEN 
KNKAPINSLLVTNLCVQAFLITFLFTQSAYRFGFALASSAILIPYAFTALYQLQF 
TLREDKSTPGHQKNLIIGILATIYAVYLIYAGGFDYLLLTMIAYTLGMILYIKMR 
5 KDDKLGVIMVIAVSSVKLLS 



Sequence description: 

10 

A] Length: 1356 bp - 452 aa (partial sequence) 

B] ATG start codon is preceded by an possible 
Shine-Dalgarno sequence. Possesses a potential 
leader peptide sequence. 

15 



ID- 120 

20 

Clone 3-8 

csebl m>: ri) 

ATGAAATTTGAAAAACGGCAGGTCTATTATGTTGTCATAACATTTGCTATT 

25 TGCTATGCTATACAGGCTTATTGGGGAGCTGTTTCTAATATTTTAACTAGGC 
TTCATAAGGCAATATTTCCTTTTTTGATGGGAGCTGGAATTGCCTATATTAT 
TAATATTGTAATGTCAGTCTATGAGCGATTATATATAAAGCTTTTTAAAGG 
ATCTAGACTATTAATGGCAATCAAGCGTAGTGTTTCTATGATTTTATCCTAT 
GCAACTTTTATTGGTTTAATTGTCTGGCTATTTTCAATTGTCATTCCAGATT 

30 TGATTTCTAGTTTGAGTTCTTTATTGGTTATTGATACCGGAGCACTTGCTAA 
ATTGGTTAATAATCTCAATGAAAATAAACAAATTTCTGAGGCTTTAAATTA 
TATGGGAACAGATAAAGACTTAGTTTCTACTTTAAGTGGTTATAGCCAGCA 
GATTTTGAAGCAAGTTTTATCTGTTTTAACAAATTTACTAACCTCAGTTTCC 
TCTATTGCGGCAACACTTCTGAATGTTTTTGTTAGTTTTATTTTTTCAATTTA 

35 CGTTTTGGCAAACAAGGAGCAGTTGGGACGTCAATTTAATTTGTTAATTGA 
TACCTATTTAGGTTCAACAGGCAAAACATTCCATTACGTTCGTCATATCCTT 
CATCAACGTTTCCATGGTTTTTTTGTAAGCCAAACTTTAGAAGCTATGATTT 
TAGGAAGTTTGACGGTTATTGGTATGTTGATCTTCCAATTTCCTTATGCTTT 
AACAGTTGGGGTTTTAGTTGCTTTTACAGCTCTAATACCGGTTGTGGGAGC 

40 CTACATTGGTGTTACAATCGGTTTCATCTTAATTGCTACTGAATCGCTTACT 
GAAGCATTCTTGTTTGTTCTTTTCTTGATCCTTTTACAACAATTTGAGGGAA 
ATGTCATTTATCCGAAAGTTGTCGGTGGATCGATTGGACTGCCTTCTATGT 
GGGTTTTAATGGCTATTACTATCGGAGGTGCTTTATGGGGGATCTTAGGCA 



TGTTACTTGCTGTTCCTGTTGCAGCTACTATCTATCAGATTGTAAAAGATCA 
TATTATCAAGCGACAAACGCTTAGAAATCGTGCACGAACCTATCGTTAA 



^KFEK^QVY?VVITFAICYAIQAYWGAVSNILTTLHKAIFPFLMGAGIAYIINI 
5 VMSVYERLYIKLFKGSRLLMAIKRSVSMILSYATFIGLIVWLFSIVIPDLISSLSS 
LLVIDTGALAKLVNNLNENKQISEALNYMGTDKDLVSTLSGYSQQILKQVLSV 
LTNLLTSVSSIAATLLNVFVSFIFSIYVLANKEQLGRQFNLLIDTYLGSTGKTFH 
YVRHILHQRFHGFFVSQTLEAMILGSLTVIGMLIFQFPYALTVGVLVAFTALIP 
VVGAYIGVTIGFILIATESLTEAFLFVLFLILLQQFEGNVIYPKVVGGSIGLPSM 
1 0 W VLM AITIGG ALWGILGMLLA VP V AATIYQIVKDHIIKRQTLRNRART YR* 



Sequence description: 

15 A] Length: 1 134 bp - 378 aa (full-length gene) 

B] ATG start codon is preceded by an typical 
Shine-Dalgarno sequence. Possesses a potential 
leader peptide sequence. 



20 



ID-121 



Identical to ED-68, as described in WO 00/06736 

25 



ID-122 

30 

Clone 3-16 
(SEH TO NO'. fl) 

GTGATTACAATTAAAAAGGAATCTGTTATCAAACTATTGAAGTATGCTTTT 
35 GGCATTATAATGGGATTTATTATCTTAGCTATTGTAATAGGTGGGCTCCTA 
TTTGCATACTACGTTAGTCGTTCTCCGAAATTAACCGATCAAGCTTTAAAA 
TCCGTTAACTCTAGTTTGGTTTATGATGGTAATAATAAACTTATTGCCGATT 
TAGGCTCAGAAAAGCGTGAAAGTGTTAGTGCGGATAGCATTCCACTAAAT 
TTGGTT AAGGCT ATC AGTTCTATAG A AG AT A A ACGTTTGTTT AAAC ATAG A 
40 GGTGTCGATATTTATCGTATTTTAGGTGCAGCTTGGCATAACCTTGTTAGTA 
GTAATACGCAAGGTGGTTCAACCCTTGATCAACAGTTGATTAAACTGGCTT 
ACTTTTCTACCAATAAATCTGACCAAACGTTAAAACGTAAATCACAGGAA 
GTTTGGCTTGCGCTTCAAATGGAGCGTAAATACACCAAAGAAGAAATTCTT 
ACTTTCTATATTAATAAAGTTTATATGGGAAATGGGAATTATGGTATGAGA 



ACAACAGCTAAATCATACTTTGGTAAAGACCTAAAGGAATTATCTATTGCA 
CAACTTGCTTTGCTCGCTGGTATTCCTCAAGCACCTACACAATATGACCCTT 
ATAAAAACCCAGAATCTGCTCAAACAAGACGTAATACCGTTCTTCAGCAG 
ATGTATCAAGATAAAAACATTTCTAAAAAGGAATACGACCAAGCTGTTGC 
5 AACTCCAGTAACTGATGGCTTAAAAGAATTAAAGCAAAAATCTACTTATCC 
AAAATATATGGATAACTACTTAAAACAAGTTATTAGTGAAGTTAAACAAA 
AAACTGGTAAAGATATCTTTACTGCTGGGCTAAAAGTGTATACTAATATCA 
ACACTGATGCACAAAAACAACTATATGACATCTACAACAGTGATACTTAC 
ATCGCTTATCCAAACAATGAATTACAAATAGCATCTACCATCATGGATGCG 

1 0 ACTA ATGGT A AAGTC ATTGC AC AATTAGGCGGGCGTC ATC AG A ATGA A AA 
TATTTCATTTGGGACAAATCAATCTGTCTTAACAGACCGCGATTGGGGTTC 
TACAATGAAACCTATCTCAGCTTATGCACCTGCTATTGATAGTGGTGTCTA 
TAATTCAACAGGTCAATCATTAAACGACTCAGTTTACTACTGGCCTGGTAC 
TTCTACTCAACTATATGACTGGGATCGTCAATATATGGGTTGGATGAGTAT 

1 5 GC AGACCGCTATTC AACAATC ACGTAACGTCCCTGCTGTC AGAGC ACTTG A 
AGCCGCTGGATTAGACGAAGCAAAATCTTTCCTTGAAAAATTAGGCATAT 
ACTATCCAGAAATG 

CSEQ ^0 w. e \£\ 

MmKKESVlia.LKYAFGIIMGFIILAIVIGGLLFAYYVSRSPKLTDQALKSVNSS 
20 LVYDGNNKLIADLGSEKRESVSADSIPLNLVNAITSffiDKRFFKHRGVDIYRILG 
AAWHNLVSSNTQGGSTLDQQLIKLAYFSTNKSDQTLKRKSQEVWLALQMER 
KYTKEEELTFYINKVYMGNGNYGMRTTAKSYFGKDLKELSIAQLALLAGIPQA 
PTQYDPYKNPESAQTRRNTVLQQMYQDKNISKKEYDQAVATPVTDGLKELK 
QKSTYPKYMDNYLKQVISEVKQKTGKDIFTAGLKVYTNINTDAQKQLYDIYN 
25 SDTYIAYPNNELQIASTIMDATNGKVIAQLGGRHQNENISFGTNQSVLTDRDW 
GSTMKPISAYAPAIDSGVYNSTGQSLNDSVYYWPGTSTQLYDWDRQYMGWM 
SMQTAIQQSRNVPAVRALEAAGLDEAKSFLEKLGIYYPEM 



30 Sequence description: 

A] Length: 1386 bp - 462 aa (partial sequence) 

B] GTG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possesses a 

35 potential leader peptide sequence. 



ID- 123 

40 



Clone 3-17 



isekzzo no'- ^3) 

ATGGCTAATGTATATGATTTAGCAAATGAATTAGAACGTGCTGTTCGTGCT 
TTACCAGAATACCAAGCAGTTTTAACTGCAAAAGCAGCTATTGAAAATGA 
TGCGGATGCACAAGTGCTTTGGCAAGACTTTTTGGCTACCCAATCAAAAGT 
TCAAGAAATGATGCAATCTGGCCAAATGCCAAGTCAAGAAGAACAAGATG 
5 AAATGTCTAAACTTGGGGAAAAAATTGAATCCAATGACCTTTTAAAAGTTT 
ATTTTGACCAACAACAACGGTTGTCTGTCTATATGTCTGATATCGAAAAAA 
TTGTCTTTGCACCCATGCAGGACTTGATGTAA 

(SE<L^Dj^O\ in) 

MANVYDLANELERAVRALPEYQAVLTAKAAIENDADAQVLWQDFLATQSK 
10 VQEMMQSGQMPSQEEQDEMSKLGEKffiSNDLLKVYFDQQQRLSVYMSDIEKI 
VFAPMQDLM* 



Sequence description: 

15 

A] Length: 336 bp - 1 12 aa (full length sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



ID- 124 

25 

Clone 3-26 
(SBC 3D Wd' <\5) 

ATGGCAGAAATCACAGCTAAACTTGTAAAAGAATTGCGTGAAAAATCAGG 
30 TGCAGGCGTTATGGACGCTAAAAAAGCATTAGTAGAAACTGATGGTGACC 
TTGATAAAGCGATTGAATTACTTCGCGAAAAAGGTATGGCTAAAGCAGCT 
AAAAAAGCAGACCGTGTTGCTGCTGAAGGTTTAACAGGTGTTTATGTTGAT 
GGTAACGTTGCAGCAGTTATTGAAGTTAA 

35 MAElTAKLVKELREKSGAGVMDAKKALVETDGDLDKAIELLREKGMAKAAK 
KADRVAAEGLTGVYVDGNVAAVffiV 



Sequence description: - 

40 

A] Length: 230 bp - 76 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. No obvious 
potential leader peptide sequence. 



ID- 125 

5 



Clone 3-33 

( sea. J» NV \ 14) 

1 0 ATGATAA A A AACCTGTTATT AAC AGGTTTTTTATC ATTTA ATG ACGG AA AA 
CTGGACACAAATTATTTTTGTTGTATAATTAAATATATTATTTCTTATCAGG 
AGGTTATGATGACATTAGAGAAACGATTTAA 

LSE6L DX> m\ 

MIKNLLLTGFLSFNDGKLDTNYFSCIIKYnSYQEVMMTLEKRF 



25 

ID- 126 



Clone 3-41 

30 

(SZB. J^o (id : ^ 

ATGAAAAATAATAAAAATAATGGTTTTCTGAAAAATTCCTTTATTTACATA 
TTATTGATTATTGCGGTTATTACAACCTTTCAATACTATTTAA 

35 MKNNKNNGFLKNSFIYILLIIAVrrTFQYYL 



15 



Sequence description: 



20 



A] Length: 134 bp - 44 aa (partial sequence) 

B] ATG start codon is preceded by an 
typical Shine-Dalgarno sequence. Possible 
potential leader peptide sequence. 



Sequence description: 



40 



A] Length: 94 bp - 31 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 



ID- 127 



Clone 3-42 
ISEQl -xo ttd'> I o 0 

ATGTTAGATATTATCTTATCCGGAATTTCGCAAGGATTACTTTGGTCAATTA 
1 0 TGGCA ATTGGCGTGTTTATC ACTTTTCGTATCTTAGACATAGCCG ATCTCTC 
TGCAGAAGGGGCTTTCCCTATGGGGGCTGCAGTTTGCGCCTTATGTATCGT 
TAA 

CSEQ. zw r*o'. lap) 

MLDIILSGISQGLLWSIMAIGVFITFRILDIADLSAEGAFPMGAAVCALCrV 



15 



25 



30 



Sequence description: 



A] Length: 158 bp - 52 aa (partial sequence) 
20 B] ATG start codon is preceded by a 

possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. 



ID-128 



Clone 3-43 



(SBQ. -50 NO'. 10$) 

ATGGAAATGCCTAAAAGAAATGAATTACTCAATAAAGAAATTAAAATGAG 
TATTGATAAACTTAGATATAAAGAACCAGAGAGTGAACATGACAAGCGAC 
CTACTTTTTATTTGGTAGTACTTATACTTGTTACTGTAGCAGTTATATTGTC 
35 GTTATTTAA 

( 5EGL 3.0 NO ! )0h) 

MEMPKRNELLNKEIKMSIDKLRYKEPESEHDKRPTFYLVVLILVTVAVILSLF 



40 Sequence description: 



A] Length: 161 bp - 53 aa (full-length gene) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 



leader peptide sequence. 



ID- 129 

5 

Clone 3-44 

LSEQ, ^<> : 105) 

1 0 GTGGT AAGT AAATTGAGTTTAAC A ACG ATTTTTGCATTGCTATTTTC ATC A 
ATGCTAATTTACGCAACACCTCTTATCTTTACAAGTATTGGGGGAACCTTC 
TCTGAACGTGGTGGTATCGTCAACGTTGGTTTAGAAGGAATTATGGTAATT 
GGAGCTTTCTCAGGCGTTGTATTTAA 

1 5 MVSKLSLTTIFALLFSSMLIYATPLIFTSIGGTFSERGGIVNVGLEGIMVIGAFSG 
VVF 



20 Sequence description: 

A] Length: 179 bp - 59 aa (partial sequence) 

B] GTG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 

25 leader peptide sequence. 



ID- 130 



30 



Clone 3-46/47 



(SB k ZEO fiO\ Wit) 

ATGAGAATTATtGCAATAACTGAAAAGGTTATAAAAGAACTGTTTCGTGAT 
35 AAAAGAACACTTGCTATGATGTTTTTAGCACCTATTTTAATTATGTTTTTGA 
TGAATGTTATGTTTTCTGCGAATAGTAATACAAAAGTTAAGATTGGAACTA 
TTAACGTTAACACGAAGGTCGTTTCAAATTTAGATAATATTAAGCATATTC 
AAGTGAGATCATTTAAATTTAACTCATCTGCTAAAAAAGCACTCAAATCAA 
ATAAAATTGATGCTCTTATTTCGGAGGACAATAAATCTTATACTGTCTTCT 
40 ATGCGAATACAGATTCTTCAAAGACGACTTTAACAAGACAAGCTTTTAAA 
ACCGCTGTTAATACAATGAACAGTAAGGAACTGATTTCGCAAGTTAAAATT 
TTAGCTAATAAGAATCCGAAACTAGCACAATCCTTACAAACTCGCTCCAAA 
TATATCAAAGAAAAATATAATTACGGAAATAAAAATACAGGCTTTTTTGC 
AAAAATGATACCAATACTAATGGGATTTATGGTCTTCTTCTTGGTTTTT 



CSEtiL m '• I OS ) 

MRIIAITEKVIKELFRDKRTLAMMFLAPILIMFLMNVMFSANSNTKVKIGTINV 
NTKVVSNLDNIKHIQVRSFKFNSSAKKALKSNKIDALISEDNKSYTVFYANTDS 
SKTTLTRQAFKTAVNTMNSKELISQVKILANKNPKLAQSLQTRSKYIKEKYNY 
5 GNKN TGFFAKMIPII MGFMVFFLVF 



Sequence description: 

10 A] Length: 558 bp - 186 aa (partial sequence) 

B] ATG start codon is preceded by a 
possible Shine-Dalgarno sequence. Potential 
leader peptide sequence. C-terminus has yet to be 
determined. 



ID-131 

20 

Clone 3-48 
CSEGL IPO fiO ' 104) 

gtGattatcgttatgagtaaacatcaagaaattttggagtacctagaaaat 
25 ttagctgttggtaagagggttagtgtacgcagtatttcaaatcatttaa 

(SEEL ~H> WO ! }\o) 

MIIVMSKHQEILEYLENLAVGKRVSVRSISNHL 

30 Sequence description: 

A] Length: 100 bp - 33 aa (partial sequence) 

B] GTG start codon is not preceded by a 
obvious Shine-Dalgarno sequence. No obvious 

35 leader peptide sequence. 



ID- 132 

40 



Clone 2-c53 



AT^G^A^GAGAAATTACCGCTGTCGAACACGATCGCTTTGTGAGCGAATCC 
AACCAAACAAACCTACTTCAATCTCTTAATTGGCCCAAAGTAAAAGACAA 
CTGGGGTAGTCAATTACTTGGCTTTTTTGACGGTGAAACCCAAATTGCCAG 
CGCTAGTATTCTCATCAAATCACTTCCTCTTGGCTTCTCCATGCTGTATATT 
5 CCGCGTGGACCAATCATGGATTACTCCAATCTAGATATTGTAACTAAGGTC 
CTTAAGGACCTTAAAGCTTTTGGCAAAAAACAAAGAGCTCTCTTTATCAAG 
TGTGATCCTCTCATCTATTT 

CSE&. -xo tib \ ua) 

MYREITAVEHDRFVSESNQTNLLQSLNWPKVKDNWGSQLLGFFDGETQIASA 
1 0 SHJKSLPLGFSMLYIPRGPIMD YSNLDIVTKVLKDLKAFGKKQRALFIKCDPLI 
Y 



15 Sequence description: 

A] Length: 326 bp - 108 aa (partial sequence) 

B] ATG start codon is preceded by an obvious 
20 Shine-Dalgarno sequence. No obvious leader 

peptide sequence. 



25 ID-133 



Clone 2-c59 

ATGGACAAGAAAAAAATCTTAGTAACGGGTATTGTGCCTAAAGAAGGTCT 
AAGAAAGCTTATGGACCGATTTGATGTTACTTATTCAGAAGATCGCCCATT 
TTCACGTGACTATGTGTTAGAGCATTTATCTGAATATGACGGATGGTTACT 
CATGGGACAAAAAGGTGATAAAGAGATGATTGATGCAGGTGAAAACTTAC 
35 AAATTATTTCTTT 

(SEEL Z£Q <V0'. /)W) 

MDKKKILVTGIVPKEGLRKLMDRFDVTYSEDRPFSRDYVLEHLSEYDGWLLM 
GQKGDKEMIDAGENLQIIS 



40 

Sequence description: 



A] Length: 215 bp - 71 aa (partial sequence) 



B] ATG start codon is preceded by an obvious 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence. 

5 

ID- 134 



10 Clone 2-c62 

( s&i HQ a)o: 115) 

ATTTCGAAAGATGACTACCAAAATATTAGTTTTGGACAGGATCCAGAAGTT 
GTTGATTATGCTGGTCTGTTTGAAAAACGCCGTCCAGTTTTAGAAAAAGCA 

1 5 GTTAAAA ATTTCTTGC AAG AAG AG AG AGCT ACGAG AATGCTATCTG ATTTC 
TTGCAAGAAGAAAAATGGGTAACTGATTTTGCTGAATTTATGGCGATCAA 
AGAACATTTTGGTAATAAGGCGCTTCAAGAATGGGATGACAAGGCTATTA 
TACGCCGCGAAGAAGAAGCCTTAGCAGGATATCGTCAAAAGCTTAGTGAA 
GTGATAAAATATCATGAAGTAACGCAATATTTCTTTTACAAACAATGGTTT 

20 GAGTTAAAAGAATATGCTAATGATAAAGGGATTCAAATTATCGGTGATAT 
GCCAATCTACGTTTCTGCCGATAGTGTAGAAGTTTGGACAATGCCTGAACT 
GTTT 

CSGL^O NO'. 

ISKDDYQNISFGQDPEVVDYAGLFEKRRPVLEKAVKNFLQEERATRMLSDFLQ 
25 EEKWVTDFAEFMAIKEHFGNKALQEWDDKAIIRREEEALAGYRQKLSEVIKY 
HEVTQYFFYKQWFELKEYANDKGIQIIGDMPIYVSADSVEVWTMPELF 

A] Length: 459 bp - 153 aa (partial sequence) 
30 B] More sequencing is required to determine the 

N- and C-termini 

enzyme). - Streptococcus pneumoniae (63%) 

ID-135 

35 

Identical to ID- 108 described in WO 00/06736 



Clone 2-c63 

40 

ID- 136 



Clone 2-c66 



CseA-zow. in} 

ATGGCAAAACAGAAAAATAACTGGCGCCGTGTTGGAGTTGGTGTCCTTAC 
ACTTGCTTCAGTTGCGACTCTTGCTGCATGTGGAAGTAAATCAGCTTCCCA 
5 GGATTCTAATGGAGCGATTAATTGGGCTATTCCAACAGAAATCAATACACT 
AGATTTATCTAAAGTTACAGACACTTACTCAAATCTAGCTATTGGTAACTC 
TAGTAGTAATTTCCTTCGCTTAGATAAAGATGGAAAGACAAGACCAGACTT 
GGCTACTAAAGTTGATGTTTCAAAAGATGGCTTAACTTATACAGCTACATT 
ACGTAAAGGCTTGAAGTGGTCAGATGGCAGTAAACTTACTGCAAAGGATT 

1 0 TTGTTTATTCATGGC A ACGTTT AGTTG ATCCTAAAAC AGCTTCACA ATATG 
CTTACCTTGCTGTTGAAGGGCATGTGCTTAATGCCGATAAAATCAACGAAG 
GACAAGAGAAAGACTTGAATAAGCTAGGTGTTAAGGCAGAAGGCGATGA 
CAAAGTTGTTATTACTTTATCTAGTCCGTCTCCGCAATTCATCTACTACCTT 
GCATTCACTAACTTCATGCCACAAAAACAAGAAGTTGTTGAAAAATATGG 

1 5 AAAAG ATT ACGC AACT ACTTC AAAA AATAC AGTTT ACTC AGGACC AT ATA 
CTGTTGAAGGTTGGAATGGTTCGAATGGTACTTTCACGCTGAAGAAAAAC 
AAAAATTATTGGGACGCTAAAAATGTAAAAACAAAAGAAGTTCGCATCCA 
GACTGTTAAAAAACCAGATACCGCCGTTCAAATGTATAAACGTGGTGAGT 
TAGATGCAGCTAATATCTCAAATACTTCTGCTATTTATCAAGCTAATAAAA 

20 ATAATAAAGATGTCACAGATGTTCTAGAAGCGACCACTGCCTATATGGAA 
TATAATACTACTGGTTCTGTGAAAGGGCTTGATAATGTTAAGATTCGTCGC 
GCCTTAAACTTAGCAACTAACCGTAAAGGAGTTGTTCAAGCAGCCGTTGAT 
ACAGGCTCAAAACCGGCAATTGCTTTTGCACCTACTGGTTTAGCCAAAACA 
CCAGATGGAACTGATTTGGCAAAATATGTTGCCCCAGGTTATGAATATAAT 

25 AAAACTGAAGCAGCAAAACTCTTTAGACTA 

ise& -xd no: ut) 

MAKQKNNWRRVGVGVLTLASVATLAACGSKSASQDSNGAHWAIPTEINTLD 
LSKVTDTYSNLAIGNSSSNFLRLDKDGKTRPDLATKVDVSKDGLTYTATLRKG 
LKWSDGSKLTAKDFVYSWQRLVDPKTASQYAYLAVEGHVLNADKINEGQEK 
30 DLNKLG VKAEGDDK V VITLS SPS PQFIY YLAFTNFMPQKQE V VEK YGKD Y AT 
TSKNTVYSGPYTVEGWNGSNGTFTLKKNKNYWDAKNVKTKEVRIQTVKKPD 
TAVQMYKRGELDAANISNTSAIYQANKNNKDVTDVLEATTAYMEYNTTGSV 
KGLDNVKIRRALNLATNRKGVVQAAVDTGSKPAIAFAPTGLAKTPDGTDLAK 
YVAPGYEYNKTEAAKLFRL 

35 

Sequence description: 



40 



A] Length: 1 143 bp - 381 aa (partial sequence) 

B] Shine-Dalgarno sequence precedes ATG codon. 
Possesses a potential leader peptide sequence. 



ID- 137 



Clone 2-c67 

(5ECl HO NO'. Ill) 

TTGAGAGTTTATGAAAATAAAGAAGAGTTGAAAAAAGAAATAAGTAAAAC 
ATTTGAGAAATACATTATGGAATTTAATAA 

TATTCCAGAGAATCTAAAAGATAAAAGAATTGATGAAGTTGATAGAACTC 
CAGCAGAAAACCTTTCTTATCAGGTTGGCT 

GGACCAACTTGGTTCTTAAATGGGAAGAAGATGAAAGAAAGGGACTTCAA 
GTAAAAACACCATCGGATAAATTT 



MRVYENKEELKKEISKTFEKYIMEFNNIPENLKDKRIDEVDRTPAENLSYQVG 
WTNLVLKWEEDERKGLQVKTPSDKF 

Sequence description 



ID-138 



Clone 2-c70 
tSEk ZPo fiOl 13)) 

ATGTCAAAGTTTGATAGTCAGAAAATAATTACTCCGATTATGAAGTTTGTC 

AATATGCGAGGGATTATTGCACTCAAAGATGGCATGCTAGCAATTTTACCA 

CTAACAGTTGTTGGGAGTCTCTTTTTAATATTAGGGCAGCTTCCATTT 

MSKFDSQKIITPIMKFVNMRGIIALKDGMLAILPLTVVGSLFLILGQLPF 



Sequence description 




A] Length: 234 bp - 78 aa (partial sequence) 

B] TTG start codon is preceded by a 
potential Shine-Dalgarno sequence. No obvious 
leader peptide sequence. 



A] Length: 150 bp - 50 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possesses a potential 



leader peptide sequence 



ID- 139 

5 

Clone 2-c71 

/SESl ^0 NO : IPS) 

1 0 6 AGACC ACTTC ATC AGTTA A ACCAGC AGG A ATTG ACCGT ATC A ATC ATACC 
TCAACACCCCCGAAGAAAACTACCCCCAACATTGCAACGACGCATAGCTT 
CAAAGATCGTTGTGATACTTTAGAAAGAATTCACAATGAAGACATTGATGT 
TTGTTCTGGATTCATTTGTGGTATGGGAGAGAGCGATGAGGGGCTCATCAC 
ATTAGCTTTCAGACTAAAAGAACTGAACCCCTATTCTATCCCTGTCAATTTT 

1 5 TT ACTTGCTGTTG A AGG AAC ACCTCTTGG AA AATATAACTATTTGACTCCC 
ATTAAATGCTTAAAAATTATGGCCATGTTGCGTTTTGTTTTTCCTTTCAAGG 
AATTAAGATTAAGTGCTGGACGGGAGGTCCATTTTGAGAATTTTGAATCAT 
TAGTCACCTTACTTGTTGACTCAACTTTTTTGGGAAATTACCTAACAGAGG 
GGGGTCGCAATCAACATACCGATATTGAATTCTTGGAAAAATTACAACTA 

20 AATCATACTAAAAAGGAATTAATTT 

CseBl 1.V m'. oh) 

ETTSSVKPAGIDRINHTSTPPKKTTPNIATTHSFKDRCDTLERIHNEDIDVCSGFI 
CGMGESDEGLITLAFRLKELNPYSIPVNFLLAVEGTPLGKYNYLTPIKCLKEMA 
MLRFVFPFKELRLSAGREVHFENFESLVTLLVDSTFLGNYLTEGGRNQHTDBEF 
25 LEKLQLNHTKKELI 



Sequence description: 

30 

A] Length: 535 bp - 178 aa (partial sequence) 

B] N- and C-termini require verification 

35 

ID- 140 



Glone 2-c73 

40 

i sm Xi> tod '■ IPS) 

ATGCCGGTTTGGACTGCACAGTCTATTCCAAAGGCATTTTTAGAAAAGCAT 

AATACTAAGGAAGGCACCTGGGCAAAACTAACCATTCTAAGTGGTTCTTTA 

GTATTTTACCAGTTATCTCCTGATGGAGAGGAAATCTCGCGGCATATTTTT 



GATGCTAGTAGTGATATTCCTTTTGTTGATCCACAAGTCTGGCATAAAGTT 

TCGCCGAATAGTCCAGACTTAAGTTGCTATCTAACTTTTTACTGCCAAAAA 

GAAGATTACTTCCATAAAAAATATGGTCTCACGCGCACACATTCTGAGGTT 

ATCGCCAGTGCACCTCTCTTATCTGAGAAGAGTAATATATTAGACCTTGGG 

TGTGGTCAAGGGCGAAACTCACTTTATTTATCGCTGCTGGGACATCAAGTG 

ACTTCTGTCGATTCAAACGGACAGAGCCTTGTAGCTTTAGAAAATATGGCA 

TTAGAAGAAGAGCTTCCTTACAATATAAAAAGGTATGATATTAATACTACT 

GCTATTGAAGGGCACTATGATTTTATTTTATCAACTGTGGTATTTATGTTTT 

T 

i sea 3V /»<)'. m\ 

MPVWTAQSIPKAFLEKHNTKEGTWAKLTILSGSLVFYQLSPDGEEISRHIFDAS 
SDIPFVDPQVWHKVSPNSPDLSCYLTFYCQKEDYFHKKYGLTRTHSEVIASAP 
LLSEKSNILDLGCGQGRNSLYLSLLGHQVTSVDSNGQSLVALENMALEEELPY 
NIKRYDINTTAIEGHYDFILSTVVFMF 



Sequence description: 



A] Length: 563 bp - 187 aa (partial sequence) 

B] N- and C-termini require verification 



ID-141 



Clone 2c76 

i sea. rva • ol) 

ATGACAAAGCAAATAATTGCCATTTGGGCTGAAGATGAAGACCATTTGAT 

TGGAGTTAATGGCGGTTTACCATGGAGGCTTCCTAAAGAGTTACATCACTT 

CAAAGAAACGACCATGGGGCAGGCTTTGCTTATGGGACGAAAGACCTTTG 

ATGGAATGAACCGTCGTGTTTTACCTGGTAGAGAGACAATCATCTTAACAA 

AAGATGAACAATTCCAAGCAGATGGAGTGACAGTCCTAAATAGTGTTGAA 

CAAGTTATAAAATGGTTTCAGGAACATAATAAGACCTTATTTATTGTAGGT 

GGTGCAAGTATTTATAAAGCATTTCTGCCTTATTGTGAAGCAATCATAAAA 

ACTAAAGTTCATGGAAAATTCAAAGGTGATACCTATTTTCCTGATGTTAAT 

CTATCTGAGTTT 

MTKQIIAIWAEDEDHLIGVNGGLPWRLPKELHHFKETTMGQALLMGRKTFDG 

MNRRVLPGRETIILTKDEQFQADGVTVLNSVEQVIKWFQEHNKTLFIVGGASI 

YKAFLPYCEAIIKTKVHGKFKGDTYFPDVNLSEF 



Sequence description 



35 



A] Length: 417 bp - 139 aa (partial sequence) 

B] ATG start codon is preceded by a Shine- 
Dalgarno sequence. No leader peptide sequence 



10 ID- 142 



Clone 2-c78 

15 CS&L ZW NO • 

TTGTGGCCAAACTGTGCCCCGCTTATTAATAGCACTTTGTTCACCATTGAA 

GATATCTTAACATCAGGTGCTCATAGCAACCCTATTTTAATGGGGGTTATA 

CTTGGCGGGACAATTGTAGTAGTGGCGACAGCACCACTTTCTTCTATGGCA 

TTGACAGCTATGCTAGGATTAACCGGAATGCCTATGGCTATAGGAGCCTTG 

20 TCTGTCTTTGGTTCGTCATTTATGAATGGTGTACTTTTCCATAAATTAAAAC 
TTGGAAGTCGTAAAGATAATATAGCTTTTGCTGTTGAGCCTCTAACTCAAG 
CTGACGTGACTTCAGCTAACCCTATTCCAATCTATGTCACTAATTTTGTTGG 
TGGTGCAGCTTGTGGTATTTTAATTGCCTTGATGAAATTAGTTAATGATACT 
CCTGGAACAGCGACACCAATTGCAGGATTTGCTGTCATGTTTGCCTATAAC 

25 CCAATGATAAAAGTACTAATAACCGCTCTAGGTTGTATTATCCTATCTTTA 
CTAGCAGGCTATTTTGGAGGCATTGTTTTT 

LSBQl :££> ak>: i3£>3 

MWPNCAPLINSTLFTIEDILTSGAHSNPILMGVILGGTIVVVATAPLSSMALTA 
MLGLTGMPMAIGALSVFGSSFMNGVLFHKLKLGSRKDNIAFAVEPLTQADVT 
30 SANPIPIYVTNFVGGAACGILIALMKLVNDTPGTATPIAGFAVMFAYNPMIKVL 
rTALGCIILSLLAGYFGGrVF 



Sequence description: 



A] Length: 540 bp - 180 aa (partial sequence) 

B] N- and C-termini have yet to be elucidated 



40 



ID- 143 



Clone 2-c80 



ATGTTTTTAAGTATAATGGCAGGTGTCATAGCATTTGTCCTGACAGTTATT 
5 GCCATTCCACGCTTCATTAAGTTTTACCAATTGAAGAAAATTGGCGGGCAA 
CAAATGCATGAAGATGTCAAACAACATCTAGCCAAAGCAGGTACGCCGAC 
AATGGGAGGAACGGTATTTT 

tSEQ. -P0 

MFLSIMAGVtAFVLTVIAIPRFIKFYQLKKIGGQQMHEDVKQHLAKAGTPTMG 
10 GTVF 



Sequence description: 

15 

A] Length: 172 bp - 57 aa (partial sequence) 

B] Shine Dalgarno sequence precedes 'ATG' start 
codon. Possesses a potential leader peptide 
sequence. 

20 



ID- 144 

25 

Clone 3-83 
LSBGLXS) NO', i33) 

ATGAAACCATATTTATCTTTTATTGGTAGAACGTTATTATACTTCGGTATTT 
30 TATTGTTACTAATTTACTTTTTTGCATACCTTGGTCGCGGACAAGGCAGTTT 
TATTTATAA 

MKPYLSFIGRTLLYFGILLLLIYFFAYLGRGQGSFIY 

35 

Sequence description: 

A] Length: 1 13 bp - 37 aa (partial sequence) 
40 B] Putative ATG start codon is preceded by a 

typical Shine-Dalgarno sequence. Possesses a 
potential leader peptide sequence. 
This orf is not in frame with nuc 



ID- 145 



5 

Clone 3-86 
(SESLJQ W. IZS> 

ATGTCATATTTTAGAAATTACTGGTATCGTTTTGGAGCAATTTTATTTATTA 
1 0 TTTT AGC AGTAATATTGCTTGTTTTT AG ACCTG ACTGGTC A ATGCTTC ACTA 
TCTATTGTATTTTTACTTTATGGCACTTCTAGCGCATCAATTTGAAGAATAT 
CAGTTTCCCGGTGGGGCATCACCTATCATTAACTATGTTGTTTATGATGAA 
GAAGAGCTGATGGATTGTTTTCCAGGCAATACTCAGTCTATTATGTTGGTT 
AATACTATTGCTTGGTTGCTTTACATTGCTAGTATTGCTTTTCCTCAAGCTT 
1 5 ATTGGCTTGG ATT AGG AGTC ATGTTCTTT AGTCTA ACGC AGCTCTTGGGTC 
ATGGTTTTCAGATGAATATTAAACTTAAAACTTGGTATAATCCTGGTCTAG 
CAACGACAGTATTTCTCCTAGTACCAATAGCTTGCGCATACATCTATCAAG 
CTAGTGCAGAAGGAATGCTCACTTGGGGAGATTGGCTAGGTGGTTTTATCA 
TGTTGATTGTCTGTGTACTAACTAGCATTATTGCACCTGTACAGCTATTGAA 
20 GGATAAGGAGACCAATTATATTATTAGTCCTTGGCAAATGGACCGTTTTCA 
TAAGGTCGTTAATTTTGTAAGGATAAAAAAATAA 

CS£EL -JiD NO'. \3Jp) 

MSYFRNYWYRFGAILFIILAVILLVFRPDWSMLHYLLYFYFMALLAHQFEEYQ 
FPGGASPIINYVVYDEEELMDCFPGNTQSIMLVNTIAWLLYIASIAFPQAYWLG 
25 LGVMFFSLTQLLGHGFQMNIKLKTWYNPGLATTVFLLVPIACAYIYQASAEG 
MLTWGDWLGGFIMLIVCVLTSIIAPVQLLKDKETNYIISPWQMDRFHKVVNFV 
RIKK* 



30 Sequence description: 

A] Length: 651 bp - 219 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
35 typical Shine-Dalgarno sequence. Possesses a 

potential leader peptide sequence. 



40 ID- 146 



Clone 3-c88 



ATGCCACTTACAGCACTTGAAATTAAAGATAAAACATTTTCATCAAAATTT 
CGCGGTTATAGCGAAGAAGAAGTT 

5 MPLTALEIKDKTFSSKFRGYSEEEV 



Sequence description: 

10 

A] Length: 75 bp - 25 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide 



ID- 147 



20 Clone 3-90 

7 sea ~xo *>d\ 13*1) 

ATGTCACTTTTTCAAGAAAAAATTGCTTACAATTGCGCTAAAAAGGAAGCG 
CTTTATAAAGAGAGTTTAGGACGCTACGCCTTGAGATCAATGCTAGCAGG 

25 GGCTTATTTGACAATGAGTACTGCTGCCGGTATCGTCGCAGCTGATACTAT 
TGGTAAAATTTCTCCTGCTCTATCAGGTTTTGTATTTGCTTTCATCTTTAGTT 
TTGGACTTATTTATGTTTTAATATTTAATGGTGAATTGGCGACATCTAATAT 
GCTTTATCTCACTGCAGGAGCCTATAATAAAAATATCTCTTGGAAAAAAGC 
CATAACAATTTTAATTTATTGTACTTTTTTCAACCTCGTTGGTGCTTGTATA 

30 TTAGCTTGGTTGTTTAA 

(sm 3$ mo* no) 

MSLFQEKIAYNCAKKEALYKESLGRYALRSMLAGAYLTMSTAAGIVAADTIG 
KISPALSGFVFAFIFSFGLIYVLIFNGELATSNMLYLTAGAYNKNISWKKAiTILI 
YCTFFNLVGACILAWLF 

35 

Sequence description 



40 A] Length: 406 bp - 125 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possible 
leader peptide 



ID- 148 

5 

Clone 3-92 

AAGTTACAAGCGACTGAAGTTAAGAGCGTTCCGGTAGCACAACCAGCTTC 
1 0 A ACAACAAATGC AGTAGCTGCACATCCTGAAAATGCAGGGCTCCAACCTC 
ATGTTGCAGCTTATAAAGAAAAAGTAGCGTCAACTTATGGAGTTAATGAA 
TTCAGTACATACCGTGCGGGAGATCCAGGTGATCATGGTAAAGGTTTAGC 
AGTTGACTTTATTGTAGGTAAAAACCAAGCACTTGGTAATGAAGTTGCACA 
GTACTCTACACAAAATATGGCAGCAAATAACATTTCATATGTTATCTGGCA 
1 5 AC A AAAGTTTT ATTC A AAT AC AA ATAGT ATTTATGG ACCTGCT A AT ACTTG 
GAATGCAATGCCAGATCGTGGTGGCGTTACTGCCAACCACTATGACCACGT 
TCACGTATCATTTAA 

( S£9l JZ> NO '■ NT) 

KLQATEVKSVPVAQPASTTNAVAAHPENAGLQPHVAAYKEKVASTYGVNEF 
20 STYRAGDPGDHGKGLAVDFIVGKNQALGNEVAQYSTQNMAANNISYVIWQQ 
KFYSNTNSIYGPANTWNAMPDRGGVTANHYDHVHVSF 



25 



30 



Sequence description 



A] Length: 419 bp - 139 aa (partial sequence) 

B] N- and C-termini have yet to be determined 



ID- 149 



35 Clone 3-94 

(SESi TO no: /WS) 

ATGATTCCAGTAGTTATTGAACAAACAAGTCGTGGTGAACGTTCTTATGAT 
ATTTACTCACGTCTTTTAAAAGATCGTATTATTATGTTGACAGGCCAAGTT 
40 GAGGATAATATGGCCAATAGTATCATTGCACAGTTATTGTTTCTCGATGCA 
CAAGATAATACAAAGGATATTTACCTTTATGTCAATACACCAGGTGGTTCA 
GTATCGGCTGGACTTGCTATTGTGGACACCATGAACTTCATTAAATCGGAC 
GTACAGACGATTGTTATGGGGATGGCTGCTTCGATGGGAACCATTATTGCT 
TCAAGTGGTGCTAAAGGAAAACGTTTTATGTTACCGAATGCAGAATATATG 



ATCCACCAACCAATGGGCGGAACAGGCGGAGGTACACAGCAATCTGATAT 

GGCTATCGCTGCTGAGCATCTTTTAAAAACGCGTCATACTTTAGAAAAAAT 

CTTAGCTGATAATTCTGGTCAATCTATTGAAAAAGTCCATGATGATGCAGA 

GCGTGATCGTTGGATGAGTGCTCAAGAACACTTGATTATGGCTTTATTGAT 

GCTATTATGGAAAATAATAATTTACAATAATAGATTTAAAAGAGTTGAGTT 

TACCAACTCTTTTTTTATTTGTTGGAATTATGTTATAATCTTAGTAATTACA 

GATATGACGCAGAAAGGAAAAAATTATTGA 

( se<a zzo A)fl: 

MIPVVIEQTSRGERSYDIYSRLLKDRIIMLTGQVEDNMANSIIAQLLFLDAQDN 

TKDIYLYVNTPGGSVSAGLAIVDTMNFIKSDVQTIVMGMAASMGTIIASSGAK 

GKRFMLPNAEYMfflQPMGGTGGGTQQSDMAIAAEHLLKTRHTLEKILADNSG 

QSIEKVHDDAERDRWMSAQEHLIMALLMLLWKniYNNRFKRVEFTNSFFICW 

NYVIILVITDMTQKGKNY* 



Sequence description 

A] Length: 693 bp - 231 aa (full length gene) 

B] Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No leader 
peptide. Significantly, it would appear to have a 
very hydrophobic C-terminus. 



ID- 150 



Clone 2-c86 
(SERXO MO'- 

ATGAAACCAAAAaTTATTGGTGTACTTGGTCTAGGAATATTTGGACAAACA 
CTCGCACAAGAACTAAGTAACTTTGAACAAGATGTTATTGCTATTGACAGC 
AATCCTGAAAATGTACAAGCTGTCGCCGAAGT 

TGTTACAAAAGCAGCTATCGGAGACATTACTGATTTAGCTTTCCTAAAACA 

CATCGGGATCAGTGACTGTGATACTGTTATTATTGCTACAGGAAACAGTTT 

AGAGAGCTCAGTATTGGCCGTAATGCACTGTAAAAAGTTAGGCGTCCCAC 

AAGTTATTGCTAAAGCTCGAAACCTTGTATACGAAGAAGTACTTTATGAAA 

TTGGTGCTGATTTGGTTATCTCTGCGGAGCGAGAATCTGGGGAAAATGTTG 

CTGCAAACCTCATGAGAAATAAAATTACAGATGTCTTCCAGATTGAATCTG 

ATATTTCTGTCATTGAATTT 

I SEk. TO m'. I ^(t>) 

MKPKIIGVLGLGIFGQTLAQELSNFEQDVIAIDSNPENVQAVAEVVTKAAIGDI 
TDLAFLKHIGISDCDTVIIATGNSLE 



SSVLAVMHCKKLGVPQVIAKARNLVYEEVLYEIGADLVISPERESGQNVAAN 
LMRNKITD VFQIESDIS VIEF 



5 Sequence description: 

A] Length: 459 bp - 153 aa (partial sequence) 

B] Putative ATG start codon is preceded by a 
10 typical Shine-Dalgarno sequence. Possesses a 

potential leader peptide sequence. 
This orf is not in frame with nuc 



15 

ID-151 



Clone 2-c88 

20 

ISESl j:o W. Hid 

GTGCGTTATAGTAAAGAGATTATTCAGTTAGCTATACCAGCTATGATTGAA 
AATATCTTACAAATGCTCATGGGAGTAGTTGATAATTATCTAGTGGCTCAG 
TTAGGTGTTGTAGCAGTATCAGGTGTTTCAGTTGCTAATAATATAATTACT 
25 ATTTATCAAGCTATTTTTATAGCTTTAGGGGCGAGTATAGCAAGTCTATTG 
GCCAAGTCGTTAGCAGGTAGTGAGAAGGATGATGCAATTTCAGTATGTTCT 
CAAGCCATTTTTCTAACATCACTGATAGGGGCAGTATTAGGAATTATCTCG 
ATTGTTTTTGGACAAACTTTCTTT 

30 MRYSKEnQLAIPAMIENILQMLMGVVDNYLVAQLGVVAVSGVSVANNirriY 
QAIFIALGASIASLLAKSLAGSEKDDAISVCSQAIFLTSLIGAVLGnSIVFGQTFF 



Sequence description 

35 

A] Length: 330 bp - 1 10 aa (partial sequence) 

B] Putative GTG start codon is preceded by a 

typical Shine-Dalgarno sequence. May have a 

40 leader peptide 



ID-152 



Clone 2-c92 



5 ism ^€ rtd\ nO 

TTGATTAACAAGTATTCGTGCTTTTTGAAGAGGATTCTCCATAATAATACT 
CCTTTAATAGTTATCGTGAGAAGTATTTTAAAGAAAAACCGCCAAGGTAG 
AGCGACATTTCTGCCTTTAACTACAATAAAACCAAGAGAATTAGCACAAC 
ATTATCTCTCAAAATTACAAAGTTCTCAAGGGTTTTTAGGAATAGCTAGTG 
1 0 A ATTGGT A ACCT ATG ATC AACGCTTGTC AA AC ATTTTT 

(sea. -P£> fit'. 1 53} 

MINKYSCFLKRILHNNTPLIVIVRSILKKNRQGRATFLPLTTIKPRELAQHYLSK 
LQSSQGFLGIASELVTYDQRLSNIF 

15 Sequence description 



A] Length: 240 bp - 80 aa (partial sequence) 

B] No obvious Shine Dalgarno sequence precedes the Putative TTG start 
20 codon 



ID- 153 

25 

Clone 2-c94 
CSE&. 3^0 M . I5l) 

3 0 TTGTTG ACTC AC AA A A AT AT ATT ATT A ACC ATT AT ATTTGG ATT ATTTATG A 
TTATATTATCAGCATGTGGTATGTCTAATAAGGAAATGGCTGGTATTGATA 
ATTGGGAACATTATCAAAAGGAAAAGAAAATTACTATTGGATTTGATAAT 
ACTTTTGTTCCTATGGGATTTGAAAGTCGTTCTGGTGACTATACCGGCTTTG 
ATATTGATTTAGCTAATGCTGTTTTTAAAGAATACGGTATTTCAGTGAAAT 

35 GGCAGCCTATTAACTGGGATATGAAAGAAACTGAACTTAATAATGGTAAT 
AT AGACCTT ATTTGG AATGGTTATTCAAAAACGGCAGAACGTGCTAAAAA 
AGTCGCTTTTACAAACCCATATATGAATAATCATCAAGTAATTGTTACTAA 
AACTTCATCACATATTAATAGTATTAAGGATATGAAGGGGAAAAAACTAG 
GAGCCGAGTCGGGTTGATCTGGTTTTGATGCTTTTAACGCTAAACCTGATA 

40 TTTTAAAAAAGTTTGTAAAAGGAAAAGAAGCAGTTCAATACGATACTTTC 
ACTCAGGCTTTGATTGATTTAAAAAATAACCGTATTGATGGTCTTTTGATT 
GATGAAGTTTATGCTAACTATTATTTAAAGCAAGAAGGAA 



MLTHKNILLTIIFGLFMIILSACGMSNKEMAGIDNWEHYQKEKKrriGFDNTFV 

PMGFESRSGDYTGFDIDLANAVFKEYGISVKWQPINWDMKETELNNGNIDLI 
WNGYSKTAERAKKVAFTNPYMNNHQVIVTKTSSHINSIKDMKGKKLGAQSG 
SSGFDAFNAKPDILKKFVKGKEAVQYDTFTQALIDLKNNRIDGLLIDEVYANY 
YLKQEG 



Sequence description 



ID- 154 



Clone 2-c 100 
iSBL zpv aW'. ISl) 

ATGAAAATTTGGAAAAAAATAACCTTAATGTTTTCTGCAATTATTTTAACA 

ACAGTAATTGCATTGGGAGTCTATGTTGCCTCAGCTTATAATTTTTCGACTA 

ATGAATTGTCTAAGACTTTT 

(sm -jo <m : iS4) 

MKIWKKITLMFSAIILTTVIALGVYVASAYNFSTNELSKTF 



Sequence description 




A] Length: 649 bp - 216 aa (partial sequence) 

B] TTG start codon is preceded by a possible 
typical Shine-Dalgarno sequence. Has a 
leader peptide 



A] Length: 123 bp - 41 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Has a 
typical leader peptide 



ED-155 



Clone 2-cl 



(SE(L TO NO'. 155) 

ATGAAAAAACAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 

ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 

GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTTAAATATATATCAG 

ATAAAAATAACTATTTAGATAATATAAAAGTTTATTACTTTTCTATAAGTA 

TTTCTAAAGATGTACAAGATAAAGTCAGTGAAACAACAACTTGTTCATATA 

GACTAGAAAAGCAAAAGAATCAAGAGTTCATTGGTAATTTTGAACATGAA 

GTTAGTGAATCTAGTCAATATTCAACCGAAGTTAAAAATCAAATACAGTAT 

CCAATCCAGTATAAAGATAATTCAATTCGTTTTACTGAAAAAACACCGTCA 

GAACGTTATGATGAGTTTGTTTTTAGTTCATTTGATTCTTCATTATTAAAAA 

AATATAAAATATATGATTACTTACTAAAACATCCCGAAACTGAATTAAAA 

GGTGTTTCCTATAAGATTCCTATAAATTCTGAAATTGTAGCCCCTTTTATAA 

ATCAATTAAATATAAAAAATCCTAAAAAATCATCTATTTCGGTTACAAAAA 

CGGAAAGTAAAGAATATTATTATACAATCAGTATTGATACTGATTCTGAGA 

TATATTCTATATTCGAAGGTATTCAT 

MKKQRLLLLFGGLLIM1MMTACKDSKIPENRTKKEYQAEQNFKSYFKYISDKN 

NYLDNIKVYYFSISISKDVQDKVSETTTCSYRLEKQKNQEFIGNFEHEVSESSQ 

YSTEVKNQIQYPIQYKDNSIRFTEKTPSERYDEFVFSSFDSSLLKKYKIYDYLLK 

HPETELKGVSYKIPINSEIVAPFINQLNIKNPKKSSISVTKTESKEYYYTISIDTDS 

EIYSIFEGEH 



Sequence description 



A] Length: 687 bp - 229 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Has a 
typical leader peptide. C-terminus has yet to be 
verified 



ID-156 



Clone 2-c5 

ATGACATTTGACACCATTGATCAATTAGCGGTTAATACAGTCCGCACGCTT 
TCTATTGATGCTATCCAAGCAGCAAATTCTGGGCACCCAGGTCTTCCTATG 
GGAGCTGCGCCTATGGCTTATGTGCTTTGGAATAAATTCTTAAATGTAAAC 
CCAAAAACAAGTCGCAATTGGACAAACCGTGACCGTTTTGTACTTTCAGCT 



GGGCATGGTTCAGCTCTTCTTTATAGCCTACTTCATTTAGCTGGCTATGATT 
TATCAATTGATGATTT 

MTroTroQLAVNTVRTLSIDAIQAANSGHPGLPMGAAPMAYVLWNKFLNVNP 
KTS RNWTNRDRF VLS AGHGS ALL YSLLHL AG YDLS IDD 



Sequence description 



A] Length: 272 bp - 90 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. No obvious 
leader peptide 



ID- 157 

20 

Clone 2-c8 
(SEL zcv M' i5f) 

ATGAGAACACTATTTAGAATGATATTTGCTATTCCAAAGTTTATCTTTAGA 
25 TTGATTTGGAATATCATTTGGGGAATATTCAAGACAGTTCTTGTTATTGCG 
ATTATTTTATTTGGCTTGTATTACTATGCGAATCACAGTCAATCAGAATTTG 
CTAATCAACTTAGTGACATTATTCAGACAGGAAAAACATTTTT 

(5ZCLZZO NO'. Hod) 

MRTLFRMIFAIPKFIFRLIWNHWGIFKTVLVIAIILFGLYYYANHSQSEFANQLS 
30 DIIQTGKTF 



Sequence description 



A] Length: 197 bp - 65 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 



ID-158 



Clone 2-c9 



ATGTCAAAAAAAATAATATTAGGAATTTTATCTCTTTTATCTGTCGTTACTT 
TGGTGGCGTGTGGTTCATCAGACAAACAGCTACAAGATAAAGTTGAGAAA 
AAAGGGAAGTTAGTTTTAGCGGTGAGTCCAGATTATGCTCCCTTTGAGTTT 



10 N^KKIILGILSLLSVVTLVACGSSDKQLQDKVEKKGKLVLAVSPDYAPFEF 
Sequence description 



DD-159 



25 Clone 2-c 10 

ATGAAAAATCAAAGACTATTACTGCTTTTTGGAGGCTTATTAATAATGATA 
ATGATGACAGCATGTAAGGATTCAAAAATCCCAGAAAACCGCACGAAAAA 
30 GGAATACCAGGCAGAACAGAATTTTAAGTCATACTTT 

MKNQRLLLLFGGLLIMIMMTACKDSKIPENRTKKEYQAEQNFKSYF 



35 Sequence description 




15 



A] Length: 153 bp - 51 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide (not in frame with nuc) 



20 



A] Length: 139 bp - 46 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 



ID- 160 



Clone 2-c 1 1 

5 

ATGATTGGAAAATTATATTATAGCTATAGAAAGTCACGCTTATTAAGAAGT 

ATTTTATGGCTTATTTTAATTGTTGGTGTATATATGTTAGGACAACGTGTTT 

TATTATCCACTGTTCCTTTATCACATCAAGAGATAAAACTAGCAGTAGATC 

1 0 A AC ATTT ACTC A ATA ACTTTTC AGCAGT AAGTGGTGGG AGTTTT A AT AA AT 
TAAATGTTTTCACACTGGGGTTGAGTCCATGGATGTCAAGTATGATTATTT 
GGAGATTCGTTTCCTTATTTTCGTGGGCAAAAAATGCAACGAAGCGAAAA 
GCAGAAGTAGCTCAATATACTTTAATGCTTACTATCTCAGTTATACAAGCA 
TATGGTGTTTCAGGAAATCAATTTATAAAAAGCTCTTTATTAGGTTCTTATA 

15 GTGATATTGTTTTT 

MIGKLYYSYRKSRLLRSILWLILIVGVYMLGQRVLLSTVPLSHQEIKLAVDQHL 

LNNFSAVSGGSFNKLNVFTLGLSPWMSSMIIWRFVSLFSWAKNATKRKAEVA 

QYTLMLTISVIQAYGVSGNQFIKSSLLGSYSDIVF 

20 

Sequence description 



25 A] Length: 423 bp - 141 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
typical Shine-Dalgarno sequence. Possesses a 
leader peptide 



ID-161 



35 Clone 2-c 13 

CSEJl XD MO', lliff) 

AT(^AAGGTCTATTGGATTTTTTAGTTAATATTGCCAGAACGCCAGCTATT 
TTAGTCGCGTTGATAGCCATTATCGGTTTAGTACTGCAGAAAAAAGGTGTT 
40 CCTGATATTGTAAAAGGTGGAATAAAAACATTTGTTGGCTTCTTAGTGGTT 
TCTGAAGGTGCAGGGATAGTCCAAAATTCCTTGAATCCATTTGGAAAAATG 
TTTGAACATGCTTTTCATTTGGTGGGGGTAGTTCCTAATAATGAAGCCATT 
GT AGCAGT AGCTCTTACGAAGTATGGCTCAGCAACTGCTTTGATTATGTTA 
GCGGGAATGATTTTTAATATTTTAATTGCTCGTTTTACAAAA 



(SE6L ZU) «sD\ lid) 

MKGLLDFLVNIARTPAELVALIAIIGLVLQKKGVPDIVKGGIKTFVGFLVVSEG 
AGIVQNSLNPFGKMFEHAFHLVGVVPNNEAIVAVALTKYGSATALIMLAGMI 
FNILIARFTK 



Sequence description 



ID- 162 



Clone 2-c21 
(S£8i 3V fJO) 

TTGGTTGGTAAGCCCCAATTACTATTTTTAGATGAACCTACTTCCGGAATG 

GATACTTCCACACGTCAACGATTTTGGAAGCTGGTTGCGACACTAAAAAA 

AGAAGGTGACACAATTGTCTATTCTAGTCATTATATCGAAGAGGTAGAAC 

ATACAGCTGATAGGATTTTAGTACTTCATAAAGGAAAGTTATTACGCGATA 

CAACCCCCTTTGCCATGAAGCAAGAAAAAACCGAAAAGTTATTCACCGTT 

CCGCTTAGTTATCAAAAATTATTACCTACCTATTTGATTACAGAGTGTGAA 

GCCAAGAGTGATAGTATAACGTTTGTTACTGGGGAGGCTGAAACTGTATG 

GAAAATACTGGCAGATAATGGTTGTCCTATTGAAGCTATTGAGATGACCA 

ATAGAACTTTGTTAAATCGTATTTTTGAGACTACTAAGGAGGTAAAACATG 

AGAATCTTTA 

(Sesl -xo m: &d) 

MVGKPQLLFLDEPTSGMDTSTRQRFWKLVATLKKEGDTr/YSSHYIEEVEHTA 

DRILVLHKGKLLRDTTPFAMKQEKTEKLFTVPLSYQKLLPTYLITECEAKSDSI 

TFVTGEAETVWKILADNGCPEAffiMTNRTLLNRIFETTKEVKHENL 



Sequence description 

A] Length: 462 bp - 155 aa (partial sequence) 

B] B] Putative TTG start codon is not preceded by 
an obvious Shine-Dalgarno sequence. No obvious 
leader peptide. N- and C- termini require further 



A] Length: 348 bp - 1 16 aa (partial sequence) 

B] ATG start codon is preceded by a potential 
Shine-Dalgarno sequence. Possible leader 
peptide 



examination 



ID- 163 



Clone 2-c25 

CSESL ^0 fid'. &l) 

TTGAAAAAATCCAAGAGAAGCCGTAAGGCAGTGACAACAAGTGGTGAGA 

AGACTTTACTTGAGGATTTGGCAAAAATGAATTTCCTAGACGAAGTCATTA 

ATGTTATGGTTTTATATACCTTGAATAAGACAAAATCTGCTAACTTAAATA 

AGGCCTATATCATGAAAGTTGCTAATGATTTTGCCTTTCAGAATGTTATGA 

CGGCCGAAGATGCTGTGCTTAAAATTCGTGATTTTTCAGATCAAAAAGTAA 

GGACTAAAACAGAAACGAAGAAGAAACAATCGAATGTTCCTGAATGGAGT 

AATCCTGATTATAAAGATGAGGTTAGCCCAGAAAAAGAAATTGAATTAGA 

ACAGTTT 

(SEEL 3$ rt<>' 1^) 

MKKSKRSRKAVfrSGEKTLLEDLAKMNFLDEVINVMVLYTLNKTKSANLNK 
AYIMKVANDFAFQNVMTAEDAVLKIRDFSDQKVRTKTETKKKQSNVPEWSN 
PDYKDEVSPEKEIELEQF 



Sequence description 



A] Lehgth:360 bp - 120 aa (partial sequence) 

B] N- and C- termini require verification. 



ID- 164 



Clone 2-c28 
C5ESL 32? WO : ITS) 

ATGACGAATCATATTACTAAACTGATAGAAAATAGCGGAAAAAAATTGAC 

AGAAATTAGCGAAGCTACAGATATAGCCTATCCTACACTTTCTGGATACAA 

TCAAGGAATCCGCAAACCTAAAAAAGATAATGCTGAAAAATTGGCAAAAT 

ACTTTAATGTTTCCGTCGCTTACATTATGGGACTTGATAGCAACCCACATG 

CTCCATCAAATCTT 

CSEb. -xo n0\ I ?m} 

MTNHITKLIENSGKKLTEISEATDIAYPTLSGYNQGIRKPKKDNAEKLAKYFNV 
SVAYIMGLDSNPHAPSNL 



Sequence description 



A] Length:218 bp - 72 aa (partial sequence) 

B] ATG start codon is preceded by an 
obvious Shine Dalgarno sequence. No obvious 
leader peptide. 



ID- 165 



Clone 2-c29 

ise& m\ /iff) 

TTGATGAAAAGGAATAAACATTTACCGTTAACAGAAACTACCTATTATATT 

TTATTAGCTTTGTTTGAGGAAGCGCATGGCTATGCTATTATGAAAAAAGTT 

GAAGAAATGAGTGGCGGTGATGTTAGAATAGCCGCAGGGACAATGTACGG 

TGCCATTGAAAATTTACTTAAACAAAAATGGATAAAGTCTATCTCAAGTGA 

CGATAGAAGAAGAAAAGTTTATATTATTACTGAGACAGGAAAAGAAATAG 

TAGAACTTGAAACGAATCGATTAAGAAAGTTACTTAATACTGCTAATCAGT 

TGGGTTTTGGAGGAGATGGTTATGATAAAGTTT 

MMKRNKHLPLTETTYYILLALFEEAHGYAIMKKVEEMSGGDVRIAAGTMYG 
AIENLLKQKWIKSISSDDRRRKVYIITETGKEiVELETNRLRKLLNTANQLGFG 
GDGYDKV 



Sequence description 



A] Length:337 bp - 112 aa (partial sequence) 

B] TTG start codon is preceded by an 
obvious Shine Dalgarno sequence. Actual start 
codon may ATG that comes immediately after the 
TTG. Potential leader peptide. 



ID- 166 



Clone 2-c35 

tSEO- =£0 fOd'. lyfi 

5 CCCATTACTGGTGAGTTAATAGCTGAGAAATTAGGAGTACCAAGAGCAGC 
ACTAAGGTCTGATTTGCGGGTTTTAAGTATGCTAGGTATCATAGATGCAAA 
ACCTAAGGTTGGTTATTTTTATTTAGGACAGTATCATGCTTCAATAGGGAC 
AAGTCATTTTGAAAAGATGACAGTTTCAGAAATTATGGGGATCCTTCTGAC 
AGTTCATCAAAAAGATTCAGTTTATGATGTTATTGTACATATTTTTATGGA 

1 0 AGATGCTGGTTGTGCTTTT ATCTTGGATGATGATG ATTTTCTCTGTGG AGTC 
GTGTCACGTAAAGATTTACTAAAAACCAGTATTGGCGGAGGAGATCTTTCT 
AAAATGCCAATAGGAATGGTGATGACACGTATGCCACACGTGACAACTGT 
TTTAGAAAATGAAAGTCTTTTTGCGGCAGCTGATAAATTAGTGAGCAGAA 
AAGTGGATAGTCTCCCTGTCGTTCGTCATGATAAGCAATATCCCGAAAAAT 

15 TTA 

csm so m\ w) 

PITGELIAEKLGVPRAALRSDLRVLSMLGIIDAKPKVGYFYLGQYHASIGTSHF 
EKMTVSEIMGILLTVHQKDSVYDVrVHIFMEDAGCAFILDDDDFLCGVVSRKD 
LLKTSIGGGDLSKMPIGMVMTRMPHVTTVLENESLFAAADKLVSRKVDSLPV 
20 VRHDKQYPEKF 



Sequence description 

25 

A] Length:51 1 bp - 170 aa (partial sequence) 

B] N- and C-termini to be determined 

30 

ID- 167 



Clone 2-44 

35 

TTGGAAGTCATCATGCAATTTATTTATAGTATTATTGGTATTTTATTGGTAT 
TAGGAATTGTGTATGCAATTTCTTTCAATCGTAAGAGTGTTTCTCTAAGTTT 
AATTGGAAAAGCTCTTATCGTTCAAT TCATTATTGCGC TAATCTTAGTACGT 
40 ATCCCACTAGGCCAACAAGTTGTTAGTGTTGTTTCAACTGGAGTTACTAAA 
GTAATCAACTGTGGTCAAGCTGGTTT 

L $m m rod- U0) 

MEVIMQFIYSIIGILLVLGIVYAISFNRKSVSLSLIGKALIVQFIIALILVRIPLGQQ 
VVSVVSTGVTKVINCGQAG 



Sequence description 



10 



15 



20 



25 



30 



35 



A] Length: 23 3 bp - 77 aa (partial sequence) 

B] TTG start codon is preceded by a 
possible Shine Dalgarno sequence. Actual start 
codon may occur further downstream. Potential 
leader peptide. 



ID- 168 



Clone 2-46 

CAACCTAATAAAGCTTTAGAAAGTGATGAGATTGATATTAATGCTTTCCAG 

CATTATAATTACTTAACCAATTGGAATAAAGCAAATAAGACCAATCTTGTT 

TCCGTTGCTGAGACATACTTTACTTCCTTTAGATTATACTCTGGTACTAAGA 

ACGGTAAAGGTAAATACCAAACAGTTTCTGAAATTCCAAATAAAGCAACT 

ATTACTATCCCAAACGATGCAGTTAACGAAAGTCGCTCTCTCTACTTGTTA 

CAATCAGCAGGCTTGCTAAAATTGAAAGTATCAGGTGATACATTAGCAAC 

AATGTCAGATGTTGTTTCCAATCCTAAATCTTTAGATTT 

QPNKALESDEIDINAFQHYNYLTNWNKANKTNLVSVAETYFTSFRLYSGTKN 
GKGKYQTVSEIPNKATiTIPNDAVNESRSLYLLQSAGLLKLKVSGDTLATMSD 
VVSNPKSLD 



Sequence description 



A] Length: 344 bp - 1 14 aa (partial sequence) 

B] N- and C- termini require verification 



ED- 169 



Clone 2-47 

csesi nd\ \g£> 

ATGAAATGTATAATAAATAATATAAATAAAATAAAAATGATAATTGAGAT 
TTATCATAGAAGGAAAACTATTTTGAAATTAAATAAAATCATATTATCTAC 



TGCAGCTCTTACTGCTCTCTTTTTAGGATATAATAGCGTTACTGCGGATACA 
TATAATAACTATCAGCCACATAGATCAAATAATATGGATTTAACTGAGGA 
ATATAACTATAATAACCAGATAGAACTTCAGGAGCGTATAAAAAACCTAA 
ATATACCTTTT 



IHNNINKIKMIIEIYHRRKTILKLNKIILSTAALTALFLGYNSVTADTYNNY 
QPHRSNNMDLTEEYNYNNQIELQERIKNLNIPF 



10 



Sequence description 



15 



A] Length:264 bp - 88 aa (partial sequence) 

B] There is a Shine-Dalgarno sequence upstream 
of this sequence. Potential leader peptide 
sequence 



20 



25 



30 





/ 



AATAAAATAA A A A TGA T A ATTGAGA T 



> TO AA ATI AAA 1 A AAA 1 I A 1 ATT ATCT AC 



ATGAAATGO^. 
TTAT€ATAGAA(* 

TG€AG€TCTTACT€fe^ /rAQGATATAATAGCGTTACTGCGGATACA 
TAT A AT AA C T A TP A GCC > ^ AgA ICAAAIA AT ATG G ATTTAA CTG AG G A 
AIAT-AAGTATAAFAA^CAGa'TATjAA^T^ 
AT ATACCTT TT— 



35 



- MKCIINN^IKIOinEIYHRR^ILKLNKIILSTAALTA LFXGYjstsyT ADTYJ^vfY 
J^HR ^NNMDLT^EYNYNNQmLQERn Q!4LbaPF 



—Seq npnrp de s cription 



40 



_ Aj T fmgth ?fvd hp - SR aa (pnrfinl r . rt qnpncp) 

-Rj-Thprp i<: a ^hjnp-paiga^^ sequence upstream 

*tf thiR- snqiinn c g Pntentia . 1 IpaHp.r^ teprid^— 

's equence 



ED- 170 



5 Clone RS-58b 

csegl oco md' 

TTGGGTGATTATTATGGTAAGAAATATTTTGGTGAGGCAGCTAAAAAAGA 
CGTCGAACATATGGCTAAGAAAATCATTAATGTCTATAAAACACGGTTAA 

10 AAAACAACACTTGGTTATC ' 

AGAAAATACAAAAGCAATGGCCATTAAGAAACTTGATAACATGAGATTAA 
TGATTGGCTATCCAGAAGATTATCCTGATCTTTATCGTCAGTACCAATTTG 
ATAGTAAAGCAAGCTTCTTTGAAAACAATGATAACTACAGAAAATTATCG 
AACAAGAAAACATTTGAAGAATTTAACCAGTCTAATCAACGTGAACATTG 

1 5 GCAAATGAGTGCC AATGCTGTAAATGCTTATAATGATCCTAATACCAATTC 
CATAGTCTTTCCAGCAGCGATTTTTCAATCACCACTGTACGATAAAACTAA 
AACAGTTAGTCAAAATTATGGAGCTATCGGAGCAATTATTGGTCATGAAAT 
TTCACACTCATTTGATATTAATGGTATGAAATATGACGAGAAAGGGAATCT 
TCACGATTGGTGGACTAAAGAAGATTTAAATCATTATAAGAAATCAACAC 

20 AAGCTATGATTGACCAATGGGATGGCCTTAAAGCAGATGGCGGTAAAGTT 
GATGGTAAATTAACTTTAGCAGAAAATATTGCAGATAATGGTGGTGTTATG 
GCATCTCTAGAAGCTCTTAAGACTGAAAAAATCCAAACTATAAAGAATTTT 
TTGAATCATGGGCAAGTATTTGGCGTCAAAAAGCAACCAAAGAACAAAGT 
AAGTCCTCAATTCAGTCAGATGTTCATGCACCATATGAATTGA-> 

25 GAGCTAACATCCCAGTACGTAATTTCCAAGAATTTTATGATGCCTTTGGTG 
TTAAAAAAGGCGATTCAATGTATCTAAAACCAGAAAAACGTTTGACACTTT 
GGTAA 

(SB- to fMc: jfk) 

30 MGDYYGKKYFGEAAKKDVEHMAKKIINVYKTRLKNNTWLSENTKAMAIKK 
LDNMRLMIGYPDYPDLYRQYQFDSKASFFENNDNYRKLSNKKTFEEFNQSNQ 
REHWQMSANAVNAYNDPNTNSrVFPAAEFQSPLYDKTKTVSQNYGAIGAIIGH 
EISHSFDENGMKYDEKGNLHDWWTKEDLNHYKKSTQAMDDQWDGLKADGG 
KVDGKLTLAENIADNGGVMASLEALKTEKIQTIKNFLNHGQVFGVKKQPKNK 

35 VSPQFSQMFMHHMN* 

Sequence description: 

A] Length: 819 bp - 272 aa (full length gene) 
40 (107 bp of additional DNA sequence (> onwards) is 

also included. While not in-frame with the 
described orf, it also shares strong homology 
with the neutral peptidases. 



35 



r 

B] This gene sequence was not identified using the LEEP system. It was identified 
downstream of the ID-89 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-89 gene sequence. ID-89 and ID- 170 together 
show homology over their combined entire length with the neutral endopeptidases 
from Lactococcus and Lactobacillus. Possesses TTG (possible ATG start codon 
located 13 bp further downstream) start codon with no obvious signal peptide. Shine 
Dalgarno sequence not immediately obvious. Possibly located further downstream 



10 ID-171 



. Clone2-18/22b(Mod2) 

ATGACCATGATTACGCCAAGCTTCATTAAGGTATCTCTAGATGAAACAAAT 
CGTATGATGCGTATGATATCAGATTTATTAAGTTTATCGCGCATTGATAAT 
GAAGTAACGCATTTAGATGTTGAAATGACGAATTTTACAGCTTTCATGACC 
TCAATTTTGAATCGATTTGATCAGATTAGAAATCAAAAAACAGTCACAGG 

20 AAAAGTTTATGAAATTGTCAGAGATTATCCTCTTAAGTCAATTTGGGTGGA 
AATTGATACAGATAAGATGACTCAAGTGATTGATAACATTTTAAATAATGC 
AGTCAAGTATTCACCAGATGGTGGTAAGATTACAGTTAATCTACGCACAAC 
TAAAACGCAGATGATTTTATCAATATCAGACCAAGGCTTAGGTATTCCCAA 
AAAAGATTTACCTCTCATTTTTGATCGTTTTTATCGTGTTGATAAGGCGAGA 

25 AGTCGTCAACAGGGTGGGACTGGACTTGGTTTGTCAATTGCAAAAGAAAT 
TGTTAAGCAGCATAAGGGATTTATTTGGGCTAAGAGTGAGTATGGTAAAG 
GGTCTACTTTTACAATCGTCTTGCCTTATGATAAAGATGCTGTAACTTATGA 
AGAATGGGAGGACGTTGAAGATTAA 

30 (SB® TO /\>fl; \S8\ 

MTMrTPSFIKVSLDETNRMMRMISDLLSLSRIDNEVTHLDVEMTNFTAFMTSIL 

NRFDQIRNQKTVTGKVYEIVRDYPLKSIWVEIDTDKMTQVIDNILNNAVKYSP 

DGGKITVNLRTTKTQMILSISDQGLGIPKKDLPLIFDRFYRVDKARSRQQGGTG 

LGLSIAKErVKQHKGFfWAKSEYGKGSTFTIVLPYDKDAVTYEEWEDVED* 



Sequence description: 



A] Length: 613 bp - 212 aa (full-length gene possibly) 

40 B] Possible Shine Dalgarno sequence present 

upstream of a ATG start codon. May not have yet 
determined the N- portion of this gene. No 
obvious signal peptide. 



ID- 172 



5 

Clone 2-54balternate (107b) 

TTGAAAAAAATTATTACTTCTATTCTATTACTTAGTTGCATTTTTTTTATGC 

1 0 C A ACC ATCTCTGCTG A ATCTTTT A ATGCTTCCGCT A AAC ATGCCTT AGC AGT 
TGATTTAGATTCAGGAAAAATCTTGTATGAAAAAGATGCTAACAAACCCG 
CTGCTATTGCTTCCTTGACTAAAATAATGACCGTTTATATGGTCTATAAAG 
AAATTGATAACGGTAACCTCAAGTGGAATACCAAAGTAAATATATCTGAC 
TACCCTTATCAACTAACACGCGAATCTGATGCTAGTAATGTTCCTTTAGAA 

1 5 AAA AGGCGCTAT ACTGTT A A AC AACTCGTGG ACGCTGCC ATGATTTCT AGT 
GCTAACAGTGCAGCCATTGCTTTAGCTGAACATATTTCAGGAACTGAAAGT 
AAATTTGTTGATAAAATGACTGCTCAATTGGAAAAGTGGGGAATTCATGAT 
AGCCACCTAGTCAATGCTTCTGGCTTAAATAATAGTATGTTAGGCAATCAC 
ATTTATCCAAAATCGTCACAAAACGACGAAAATAAAATGAGTGCACGTGA 

20 TATTGCTATTGCTGCCTACCATTTGGTCAACGAATATCCTTCCATTCTTAAG 
ATTACTAGTAAGTCCGTTGCTAAATTTGATAAAGATATTATGCATTCTTAT 
AACTACATGCTACCAGATATGCCTGTCTTTAGACCAGGTATTACAGGTTTG 
AAAACTGGGACAACGGAATTAGCTGGCCAATCTTTTATTGCTACATCTACT 
GAAAGTGGAATGAGACTACTCACTGTTATTATGCATGCTGATAAGGCCGAT 

25 AAAGACAAATATGCTCGCTTTACAGCAACTAACTCTCTCTTGAACTATATC 
ACAAACACCTACGAACCTAACCTTGTATTAGCTAAAGGAGCTGCATATAA 
AGGTAAAGAAGCAAGTGTGAGAGACGGAAAAGAACAATCGGTCATCGCT 
GTTGCTAAAAACGATTTGAAAGTAGTACAGAAGAAAAATATCACTAAACA 
AAATCAGTTAAAAATTAACTTTAAAAAAGAGCTTACTGCTCCTATTACAAA 

30 AAAAGAGAACCTAGGGAAAGCTTATTACGTTGACCTTAATAAGGTTGGAA 
AAGGCTATCTCATAAAGGAACCTAGCGTTCATTTAGTGGCAAAAGATAGT 
ATTGAGCGCAGTTTCTTCCTCAAAGTGTGGTGGAATCATTTTGTGCGCTAC 
GTTAACGAAAAACTTTAA 



35 (SEQ. -XD A)0' WO) 

MKKIITSELLLSCIFFMPTISAESFNASAKHALAVDLDSGKILYEKDANKPAAIA 

SLTKIMTVYMVYKEIDNGNLKWNTKVNISDYPYQLTRESDASNVPLEKRRYT 

VKQLVDAAMISSANSAAIALAEHISGTESKFVDKMTAQLEKWGIHDSHLVNA 

SGLNNSMLGNHIYPKSSQNDENKMSARDIAIAAYHLVNEYPSILKITSKSVAKF 

40 DKDIMHSYNYMLPDMPVFRPGITGLKTGTTELAGQSFIATSTESGMRLLTVIM 

HADKADKDKYARFTATNSLLNYITNTYEPNLVLAKGAAYKGKEASVRDGKE 

QSVIAVAKNDLKVVQKKNITKQNQLKINFKKELTAPITKKENLGKAYYVDLN 

KVGKGYLIKEPSVHLVAKDSIERSFFLKVWWNHFVRYVNEKL* 



Sequence description: 



A] Length: 1236 bp - 412 aa (full-length gene sequence possibly) 

B] A possible Shine-Dalgarno sequence precedes the putative 'TTG' start 
codon. (needs further cloning and sequencing to verify N-terminus) 



ATGACGCTTCGAGAATTAACAATAGAAGAATTTAAAGAACATTCAGGAAA 
TTATGATTCACAATCATTTTTACAAACACCTGAGATGGCTAAACTTTTAGA 
AAAACGCGGCTATGATGTTAGGTATTTGGGATATCAAGTAGAAAATAAAC 
TAGAGATAATCAGTTTATCTTATATTATGCCAGTCACTGGTGGTTTTCAAAT 
GAAAATTGATTCAGGACCAGTTCATTCAAATTCTAAGTATCTAAAACAATT 
TTATAAAGCATTGCAAGGCTATGCCAAATCCAACGGTGTTCTAGAATTAAT 
AGTTGAGCCTTTTGATGATTACCAATTATTCACTAGTTCGGGAGTTCCTAGT 
AATCAGGGAAATGATAATCTGATTGAAGATTTTACCAGTTCAGGTTATCAC 
CATGATGGTTTAACAACTGGTTTTACTGGTAAATATTTATCTTGGCACTATG 
TTAAAAATTTAGAAGGTGTCACTTCTGAAACGTTACTATCTTCATTCTCTAA 
GACAGGACGAGCTTTGGTTAAGAAAGCAATGTCTTTTGGAATCAAGGTTC 
GCGTTCTTAAACGTGATGAGCTACATTTATTTAAAGAGATAACAACTTCTA 
CGTCAAATAGACGTGATTATATGGATAAGTCCTTAGATTATTATCAAGATT 
TTTACGATAGCTTTGAAGGCAAGGCTGAATTTGTGATTGCCACTTTAAATT 
TTAGAGAATACGACCATAACTTGCAAATAAAAGCTGAAGCATTGGAAAAT 
AAGCTT 



()E& -^o no: iiaj 

MTLRELTIEEFKEHSGNYDSQSFLQTPEMAKLLEKRGYDVRYLGYQVENKLEI 
ISLSYIMPVTGGFQMKIDSGPVHSNSKYLKQFYKALQGYAKSNGVLELIVEPF 
DDYQLFTSSGVPSNQGNDNLIEDFTSSGYHHDGLTTGFTGKYLSWHYVKNLE 
GVTSETLLSSFSKTGRALVKKAMSFGIKVRVLKRDELHLFKEITTSTSNRRDY 
MDKSLDYYQDFYDSFEGKAEFVIATLNFREYDHNLQIKAEALENKL 



Sequence description 

A) Length: 771 bp - 257 aa (partial gene sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified immediately downstream of the ID-65 gene which was identified by 



ID-173 



Clone 3-60b 




10 



15 



20 



25 



30 



35 



LEEP, during cloning and sequence analysis of the full-length ID-65 gene 
sequence. Sequence Characteristics: 
No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 



ID- 174 

Clone 2- 17b (ID-80b) 
Aid'. 113) 

TTGTCATTAAGTTTGGTTGCAGTGTTAAATCTTATCCCTCCTAAAATCATGG 

GATCAGTTATTGATGCTATTACAACTGGAAAATTAACAAGACCACAATTAC 

TATGGAATTTATTAGGTTTGGTTTTGTCAGCTTTAGCTATGTATGGGCTGCG 

TTATATTTGGCGTATGTATATTTTAGGGACTTCTTACAAATTAGGCCAAGTT 

GTCAGATACCGTTTATTTGAACATTTTACAAAAATGTCTCCTTCTTTTTATC 

AGAAATATCGTACAGGTGATTTAATGGCGCACGCGACCAACGACATCAAT 

TCTCTAACACGTCTTGCAGGAGGAGGAGTTATGTCAGCAGTGGATGCCTCT 

ATCACAGCATTAGTAACGCTTATCACCATGTTCTTTACTATTTCGTGGCAA 

ATGACATTAATTGCGGTTATCCCTTTGCCCTTAATGGCCTTAGCACTAGTA 

AATTGGGGCGAAAAACCCATGAAACCTTCAAAGAATCTCAGGCAGCCCTT 

TTCAGAATTAAATAATAAAGTG 

MSLSLVAVLNLIPPKIMGSVIDArTTGKLTRPQLLWNLLGLVLSALAMYGLRYI 
WRMYILGTSYKLGQVVRYRLFEHFTKMSPSFYQKYRTGDLMAHATNDINSLT 
RLAGGGVMSAVDASITALVTLrTMFFTISWQMTLIAVIPLPLMALALVNWGEK 
PMKPSKNLRQPFSELNNKV 



Sequence description 

A) Length: 534 bp - 178 aa (partial gene 
sequence) 

B) This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-80 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-80 gene sequence. 
Sequence Characteristics: 

No obvious leader peptide sequence 
Orf is preceded by a potential Shine- 
Dalgarno sequence. 



ID-175 



5 Clone 2- 1 1 Ab (ID- 1 03b) 

csea. -j-o AW'. H5) 

ATGCATATTGAGACTGTTATTGATTTCAAAGAATTAGGAAAAAGATATCGT 
TTTAAAAATCCTACAAAAGAATTAATAGCTGATACTTTAGAACAAGTCTTA 

1 0 GA AGTGATA A AAG A AGTTG ATTATTATC A ATCTC AAA ATTATTATGTTGTT 
GGTTATTTATCTTATGAAGCATCTGCTGCTTTTGATTCACATTTTAAAGTTT 
CTCAACAGAAGTTGGCTGGAGAACATCTAGCTTATTTTACAGTACATAAAG 
ATTGTGAGAACGAAGCTTTTCCTTTAAGTTATGAAAATGTTAGATTAGCAG 
ATAATTGGACTGCTAATGTTTCTGAGCAAGAATATCAAGAGGCAATTGCTA 

1 5 ATATTAAAGG AC AA ATTAG AC A AGG AA ATACTTATC AAGT AAATT AT AC A 
CTAGAGCTTAGCCAACAATTATGCTCGGATCC 

MHIETVIDFKELGKRYRFKNPTKELIADTLEQVLEVIKEVDYYQSQNYYVVGY 
20 LSYEASAAFDSHFKVSQQKLAGEHLAYFTVHKDCENEAFPLSYENVRLADNW 
TANVSEQEYQEAIANIKGQIRQGNTYQVNYTLELSQQLCSD 



Sequence description: 

25 

A] Length: 440 bp - 146 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 103 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-103 gene sequence. 

30 Shine Dalgarno sequence present upstream of 

ATG start codon, No apparent leader peptide sequence 

ID-176 

35 

Clone 2-18/22b(b) (ID- 104b) 

40 GTGAATAATATGTTTTATCTCAAAATAGCCTGGCATAATTTAAAACATTCT 
ATAGACCAGTACATACCATTCCTCTTAGCCAGTTTATTACTTTATTCATTGA 
CTTGTTCTACGCTACTAATCTTAATGAGTGCTGTTGGAAGAGATATGGGGA 
CAGCGGCAACGGTTCTTTTTCTTGGAGTGATTGTTTTGTCAATCTTTGCGGT 
AGTCATGGAACATTATAGCTACAATATCTTGATGAAACAGCGTAGTAGTG 



AATTTGGACTGTATAACATTTTGGGGATGAATAAACGTCAAGTTGCGCGTG 
TAGCTAGTCTAGAGCTGTTTATTATTTATATATTTCTTATTTCTATAGGAAG 
TCTGTTTAGTGCTTTTTTTGCTAAATTTATTTATTTAATTTTTGTCAACATTA 
TTAACTATCATGCACTAAATCTTAGTTTAAGTTTATGGCCATTTATTATTTG 
5 TATCGTTATATTTACAGGTATTTTTCTGACTTTAGAAGTTCCAGTTATTCGA 
CATGTTCATTTATCATCCCCATTAAGTCTTTTTAGAAAGAAACAACAGGGA 
GAAAAAGAACCAAAAGGTAATCTTATACTTGCAATTTTAGCGTTAGTAGCT 
ATCGCCATCGCTTATACAATGGCTCTTACTTCAGGTAAAGCACCTGCATTA 
GCTGTTATCTATCGTTTCTTCTTTGCAGTACTTTTAGTAATTGCTGGTACTT 

1 0 ATCTTTTTT AT ATT AGTTTTATG AC ATGGT ACTT AA A AAGGTTGCGTC AAAA 
CAAGCATTATTATTATAAATCTGAGCATTTTGTATCAACTTCGCAAATGAT 
TTTTCGAATGAAGCAAAATGCAGTAGGGTTAGCAAGTATCACTTTATTAGC 
TGTTATGGCTCTAGTTACTATTGCTACAACAGTCTCACTCTATTCAAATACA 
CAAAATGTTGTTACCGGACTATTTCCAAAATCAGTAAGTTTATCAATAGAT 

1 5 A ATTC A AA AGGTG ACGCG A A A AAT AT ATTTG A AG A AA AG ATTTTG A AG A A 
ACTAGGTAAGTCATCTAAGGAAGCTATCACTTATAATCAGACAATGATTTC 
GATGCCAGTTAGTCAATCAAGTGACTTAATATCACATCTA 



20 (SESL f»d\ i<1<f) 

MNNMFYLKIAWHNLKHSIDQYIPFLLASLLLYSLTCSTLLILMSAVGRDMGTA 

ATVLFLGVIVLSIFAVVMEHYSYNILMKQRSSEFGLYNILGMNKRQVARVASL 

ELFIIYIFLISIGSLFSAFFAKFIYLIFVNIINYHALNLSLSLWPFIICrVIFTGIFLTLE 

VPVIRHVHLSSPLSLFRKKQQGEKEPKGNLILAILALVAIAIAYTMALTSGKAP 

25 ALAVIYRFFFAVLLVIAGTYLFYISFMTWYLKRLRQNKHYYYKSEHFVSTSQM 

IFRMKQNAVGLASITLLAVMALVTIATTVSLYSNTQNVVTGLFPKSVSLSIDNS 

KGDAKNIFEEKILKKLGKSSKEAITYNQTMISMPVSQSSDLISHL 



Sequence description: 

A] Length: 1 1 19 bp - 373 aa (partial gene sequence) 
35 B] This gene sequence was not identified using the LEEP system. It was 

identified upstream of the ID- 104 gene which was identified by LEEP, during 

cloning and sequence analysis of the full-length ID- 104 gene sequence. 

Possible Shine Dalgarno sequence present 

upstream of a GTG start codon. Possesses a potential 
40 leader peptide sequence 



ID-177 



Clone 2-5b (ID- 112b) 

5 ATGGTTGAGCCAATTATTTCAATACAAGGACTTCATAAAAGTTTTGGGAAA 
AATGAGGTTTTAAAAGGCATTGACTTGGATATTCATCAAGGAGAAGTGGT 
GGTTATTATTGGCCCTTCTGGCTCTGGTAAGTCAACATTTTTAAGAACAAT 
GAATCTCTTGGAAGTACCAACAAAGGGAACAGTGACTTTTGAAGGGATTG 
ATATAACAGACAAAAAGAATGATATTTTTAAAATGCGCGAAAAAATGGGC 

1 0 ATGGTTTTTC A AC AGTTC A ATCTATTTCCC A ATATG ACTGTACT AGA AA AT 
ATTACTTTATCACCTATTAAGACAAAGGGACTTTCTAAGCTTGATGCTCAG 
ACAAAAGCATACGAGCTACTTGAAAAAGTTGGACTCAAAGAGAAGGCTAA 
TGCTTATCCAGCAAGCTTATCTGGAGGACAACAACAACGGATTGCTATTGC 
AAGAGGTCTTGCAATGAATCCTGATGTCCTTCTTTTTGATGAACCTACTTCA 

1 5 GCTCTTG ATCCTG A A ATGGT AGGTG AAGTCTTG ACTGTT ATGC A AGATTTA 
GCTAAATCTGGTATGACGATGGTTATTGTCACTCATGAAATGGGTTTTGCA 
CGTGAAGTAGCGGATCGTGTCATTtTTATGGATGCAGGGATTATTGTTGAG 
CAAGGGACCCCTAAGAAAGTATTTGAGCAGACAAAAGAAATCCGCACAAG 
AGACTTCTTAAGTAAAGTATTATAA 

20 

MVEPHSIQGLHKSFGKNEVLKGIDLDIHQGEVVVIIGPSGSGKSTFLRTMNLLE 
VPTKGTVTFEGIDITDKKNDIFKMREKMGMVFQQFNLFPNMTVLENITLSPIKT 
KGLSKLDAQTKAYELLEKVGLKEKANAYPASLSGGQQQRIAIARGLAMNPDV 
25 LLFDEPTSALDPEMVGEVLTVMQDLAKSGMTMVrVTHEMGFAREVADRVIF 
MDAGIIVEQGTPKKVFEQTKEIRTRDFLSKVL* 



30 Sequence description: 

A] Length: 735 bp - 244 aa (full length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 112 gene which was identified by LEEP, 

35 during cloning and sequence analysis of the full-length ID-112 gene sequence. 

Shine-Dalgarno sequence precedes the 'ATG' 
start codon. No obvious leader peptide 



40 ID- 178 



Clone 2-5c (ID- 112c) 



ATGTCTCAsTATCAAGAGTGGTTAGAAAACGACTCACTCGGTAAAGATATT 
AAGTCAGATTTAGAAGCTATTAAAGGAGATGAATCTGAAATTCAGGATCG 
TTTTTACAAAACATTAGAATTTGGAACGGCGGGATTGAGAGGTAAACTTG 
GAGCAGGAACCAATCGTATGAATACTTATATGGTGGGGAAAGCAGCACAA 
5 GCATTAGCTAATCGATTATTGATCATGGCCCTGAAGCTATTGCACGTGGAA 
TTGCAGTTAGTTATGATGTCCCGTTATCAATCTAAGGAATTTGCAGAATTA 
ACTTGGTCCATTATGGCAGCAAATGGTATTAAAGCCTTATATTTA 

CSE6L ~3D m\ &&>\ _ „ 

MSHMNYKEIYQEWLENDSLGKDIKSDLEAIKGDESEIQDRFYKTLEFGTAGLR 

1 0 GKLGAGTNRMNT YM VGKAAQALANRLLIM ALKLLHVELQLVMMSR YQSKE 

FAELTWSIMAANGIKALYL 



Sequence description: 

15 

A] Length: 366 bp - 122 aa (partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-1 12 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-1 12 gene sequence. 

20 Shine-Dalgarno sequence preceded the 'ATG' 

start codon. No obvious potential leader peptide sequence. 

ID-179 

25 

Clone 2-5d (ID-1 1 2d) 

(se&. xo no: ^di,') 

ATGCAACCTGTAAAAGTCGATGAACCTTCTGTTGAAGAAACCATTACTATT 
TTGAAAGGTATCCAAAAAAAATACGAAGATTATCATCACGTAAAATATAA 

30 TAATGATGCCATAGAAGCAGCTGCAGTACTATCTAATCGTTATATCCAAGA 
CCGCTTTTTACCTGATAAAGCAATAGACTTATTAGATGAAGCTGGTTCTAA 
AATGAACCTAACACTAAATTTTGTTGATCCAAAAGAAATTGATCAACGTCT 
CATTGAAGCAGAAAATTTAAAAGCGCAAGCGACTCGTGAAGAAGATTACG 
AACGTGCAGCTTACTTCCGTGACCAGATTGCAAAATATAAAGAAATGCAG 

35 CAACAAAAGGTCGACGATCAAGATACACCTATTATTACCGAAAAAACAAT 
TGAGCACATCATTGAAGAAAAAACGAATATCCCTGTTGGTGATTTAAAAG 
AAAAAGAACAATCTCAATTAATTAATCTCGCAGATGACTTGAAACAGCAT 
GTGATCGGCCAGGATGACGCTGTCATTAAGATTGCAAAAGCTATTCGTCGT 
AATGGAGTTGGTCTTGGTAGCCG AAACCGTCCTATTGGTTCCTTTTTATTTG - 

40 TAGGACCAACCGGTGTTGGTAAAACTGAACTTTCTAAACAACTAGCAATTG 
AGCTCTTTGGTTCAGCTGATAGTATGATTCGTTTTGATATGTCAGAGTACAT 
GGAAAAGCATGCTGTTGCTAAATTAGTCGGAGCGCCTCCAGGATACGTGG 
GATACGAGGAAGCTGGACAACTAACTGAAAAGGTTCGTCGAAATCCTTAC 
TCGCTCATCCTTCTAGATGAAATTGAAAAAGCTCATCCCGATGTCATGCAT 



ATGTTCTTGCAGGTCCTTGATGACGGTCGATTAACAGATGGACAAGGAAG 
AACTGTTAGTTTTAAAGATACCATTATCATCATGACCTCAAATGCTGGTTC 
TGGTAAAACTGAAGCAAGTGTTGGCTTTGGTGCCTCACGAGAAGGTAGGA 
CGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCAT 
5 GCAAGC 

i^?v^Vdep^?eetitilkgiqkkyedyhhvkynndaieaaavlsnryiqdrf 
lpdkaidlldeagskmnltlnfvdpkeidqrlffiaenlkaqatreedyeraay 
frdqiakykemqqqkvddqdtpiitektiehiieektnipvgdlkekeqsqlinl 
10 addlkqhvigqddavikiakairrnrvglgspnrpigsflfvgptgvgktelsk 
qlaielfgsadsmirfdmseymekhavaklvgappgyvgyeeagqltekvrr 
npyslilldeiekahpdvmhmflqvlddgrltdgqgrtvsfkdtiiimtsnags 
gkteasvgfgasregrtnsssvpgdplestcrhas 

15 



Sequence description: 

20 

A] Length: 1070 bp y 356 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-1 12 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-1 12 gene sequence. Shine- 

25 Dalgarno sequence preceded the 'ATG' 

start codon. No obvious potential leader peptide 
sequence. 

ID-180 

30 

Clone 2-7b (ED- 113b) 

(SE& TO rtd\20S) 

ATGAGAGGGAAGGTTATTTACGGCACAACCCTTATAGGTCTTTTTCTATTC 
35 TTATTTTTCTATTTTTGGATTCCTAAGCATCACATCGAGAGAATACATCATC 
ATCGTATAAAGCAGGTAGATGCGAAGAGTGATTTAACAGGATTTAAAACC 
CATTTGCCCATTATCAGCATTGATACAAAGCAACAAGTTATTCCTCTTGTT 
ACAAAAGAAGGCGGAAAATATGTCAAAGCTAGGGATAATATTAATGTTGA 
TATCGAATTACGGGATTCTCCAAGTAGATCACATCATTTATCAGAAAAGCC 
40 GAGAATTAGGACAAAAGGGTTAATATCATATAGAGGAAATTCCTCTCGTT 
ACTTTGATAAGAAGTCATTGAAAGTTAAGTTTGTTACTAATAAGTTAAAGG 
AAAAGAAGCATCGATTAGCAGGAATGCCTAAAGAATCGGAGTGGGTATTG 
CATGGTCCCTTTCTAGACAGAACATTATTAAGAAATTATCTGAGTTATAAT 



ATTGCTGGTGAGATTATGCCTATGCCCCAAACGTTCGCTACTGTGAGTTAT 
TTGTCAATGGTGAGTATCAGGGAG 

(SEBl JtO 100) £0(A 

MRGKVIYGTTLIGLFLFLFFYFWIPKHHffiRfflHHRIKQVDAKSDLTGFKTHLPII 
SIDTKQQVIPLVTKEGGKYVKARDNINVDIELRDSPSRSHHLSEKPRIRTKGLIS 
YRGNSSRYFDKKSLKVKFVTNKLKEKKHRLAGMPKESEWVLHGPFLDRTLLR 
NYLSYNIAGEIMPMPQTFATVSYLSMVSIRE 



Sequence description: 

A] Length: 582 bp - 194 aa (Partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 113 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-113 gene sequence. 
ATG start codon is preceded by a Shine- 

Dalgarno sequence-Possesses a potential leader peptide 
sequence. C-terminus to be determined. 



ID-181 



Clone 2- 17b (ID- 117b) 

i sea. ZLO tOt '■ ?0=h\ 

CTTCACATTTTATTGATCACTATCTGACAAATGTTAATCAAACAGCAGTTCT 

TATTTTAGTGGGATATTATTCAATGTATGTCTTGCAGACCTTAATTCAATAT 

TTTGGGAATCTCTTTTTTGCGCGTGTTTCTTATAGTATTGTTAGAGATATTC 

GTAGAGATGCTTTTGCTAATATGGAAAGGCTAGGCATGTCTTATTTTGATA 

GGACACCGGCAGGATCTATTGTGTCACGTATTACTAATGATACTGAAGCAA 

TATCTGATATGTTTTCGGGTATTTTATCAAGTTTTATCTCGGCGATATTTAT 

TTTTACAGTTACTCTGTACACTATGTTGATGCTAGACATTAAACTAACAGG 

ACTCGTCGCTCTTTTGTTACCTGTTATCTTTATATTAGTGAATGTCTATCGG 

AAAAAATCAGTCACTGTCATTGCTAAAACGAGAAGTTTACTTAGTGATATC 

AACAGTAAATTATCAGAAAGTATTGAAGGAATTC 

SHFIDHYLTNVNQTAVLILVGYYSMYVLQTLIQYFGNLFFARVSYSrVRDIRRD 
AFANMERLGMS YFDRTP AGS TVS RITNDTE AISDMFS GILS SFIS AIFIFT VTL YT 
MLMLDIKLTGLVALLLPVIFILVNVYRKKSVTVIAKTRSLLSDINSKLSESIEGI 



Sequence description: 



A] Length: 498 bp - 165 aa (Partial gene sequence) 
5 B] This gene sequence was not identified using the LEEP system. It was 

identified downstream of the ID-117 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-117 gene sequence. 
N- and C-termini have yet to be determined 

10 



ID- 182 

15 

Clone 3-8b (ID- 120b) 

ATGTACCATATTGAATTAAAAAAGGAAGCTTTACTACCAAGAGAACGCCT 
20 AGTTGATTTAGGCGCAGATAGATTGAGTAATCAGGAGTTATTAGCCATTCT 
CTTACGTACAGGTATTAAAGAAAAACCTGTTCTTGAAATTTCAACGCAAAT 
TTTAGAAAACATAAGCAGTTTAGCAGATTTTGGTCAATTATCCTTACAGGA 
GTTGCAATCCATTAAAGGAATCGGTCAGGTTAAATCCGTCGAAATAAAAG 
CTATGCTAGAACTAGCAAAACGGATTCACAAAGCTGAATATGATCGTAAA 
25 GAGCAAATTTTAAGTAGTGAACAATTAGCGAGGAAAATGATGCTCGAATT 
AGGGGATAAAAAACAAGAACATTTAGTAGCTATTTATATGGATACACAAA 
ATCGTATTATCGAACAGAGAACTATTTTTATTGGTACTGTACGTCGTTCAG 
TAGCAGAGCCAAGAGAAATTCTACATTATGCTTGTAAAAACATGGCAACT 
TCTTTGATTATTATACATAATCATCCCTCAGGTTCTCCAAATCCCAGTGAAA 
30 GTGATTTAAGTTTCACTAAAAAAATAAAACGATCATGTGATCATCTGGGAA 
TTGTCTGCCTAGATCACATCATCGTTGGAAAAAATAAATATTATAGTTTTC 
GAGAAGAAGCAGATATTTTATAA 

MYHffiLKKEALLPRERLVDLGADRLSNQELLAILLRTGIKEKPVLEISTQILENI 
35 SSLADFGQLSLQELQSIKGIGQVKSVEIKAMLELAKRIHKAEYDRKEQELSSEQ 
LARKMMLELGDKKQEHLVAIYMDTQNRIIEQRTIFIGTVRRSVAEPREILHYAC 
KNMATSLHIHNHPSGSPNPSESDLSFTKKIKRSCDHLGiVCLDHIIVGKNKYYSF 
REEADIL* 



40 Sequence description: 

A] Length: 681 bp - 227 aa (full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 120 gene which was identified by LEEP, 



during cloning and sequence analysis of the full-length ED- 120 gene sequence. 
ATG start codon is preceded by an typical 
Shine-Dalgarno sequence. No obvious leader 
peptide sequence 



ID- 183 

10 

Clone 3-llb(ID-121b) 

TGGTTAAAAGTAGTGATAGCTTGTATTCCATCTATTTTAATTGCTTTACCAT 
1 5 TTG AT AATTGGTTTG AAGCTC ATTTTA ATTTC ATGATTCCGATTGC A AT AGC 
CCTAATCTTTTATGGTTTTGTCTTCATATGGGTTGAAAAACGTAATGCACAC 
CTCAAACCACAGGTAACCGAATTGGCAAGTATGTCTTACAAGACAGCTTTC 
TTGATTGGATGTTTCCAGGTTCTCAGTATTGTTCCGGGAACCAGTCGTTCTG 
GAGCTACTATTTTAGGAGCAATTATTATTGGAACTAGTCGTTCGGTCGCTG 
20 CTGACTTTACTTTCTTCCTTGCCATCCCAACTATGTTTGGTTATAGTGGACT 
TAAGGCGGTTAAATATTTTTTAGATGGTAACGTCTTGAGTTTAGACCAATC 
TTTAATACTTTTAGTAGCAAGTCTGACAGCTTTCGTAGTTAGTTTATATGTT 
ATTCGTTTCTTGACAGACTATGTCAAACGACACGATTTCACCATCTTTGGT 
AAGTATCGTATAGTCTTAGGAAGTTTACTCATCCTCTACTGGTTAGTTGTTC 
25 ATTTATTCTAA 

LS€BL NO'- 

WLKVVIACIPSILIALPFDlNrWFEAHFNFMIPIAIALIFYGFVFIWVEK^ 
QVTELASMSYKTAFLIGCFQVLSrVPGTSRSGATILGAIIIGTSRSVAADFTFFLA 
30 IPTMFGYSGLKAVKYFLDGNVLSLDQSLILLVASLTAFVVSLYVIRFLTDYVKR 
HDFTIFGKYRIVLGSLLILYWLVVHLF* 

Sequence description: 

35 A] Length: 579 bp - 193 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence 
described in WO 00/06736. N-terminus has yet to be determined. 

40 

ID- 184 



Clone 3-1 lc (ID-121c) 



10 



15 



20 



25 



30 





ATGGAAATGAAACAAATCAGTGAAACAACACTGAAAATTACAATTAGTAT 
GGAAGATTTAGAAGATCGTGGTATGGAGCTGAAAGATTTCCTAATCCCTCA 
GGAGAAGACTGAGGAATTTTTCTATTCTGTCATGGATGAATTAGACTTGCC 
AGAAAACTTTAAAAATAGTGGTATGTTAAGTTTTCGAGTAACACCTAAAA 
AAGATCGCATTGATGTTTTTGTTACAAAGTCTGAATTAAGTAAAGATTTAA 
ATTTAGAAGAATTAGCAGATTTGGGTGACATTTCAAAAATGTCTCCAGAAG 
ACTTTTTTAAAACCTTGGAACAATCGATGTTGGAAAAAGGGGATACGGAT 
GCCCATGCCAAATTAGCAGAAATTGAAAATATGATGGATAAAGCAACTCA 
AGAAGTAGTTGAGGAAAATGTTTCTGAAGAACAACCTGAAAAGGAAGTAG 
AAACGATTGGATATGTTCACTATGTCTTTGATTTTGATAATATTGAAGCTGT 
AGTTCGATTTTCACAAACGATTGATTTTCCAATAGAAGCTT 

MEMKQISETTLKITISMEDLEDRGMELKDFLIPQEKTEEFFYSVMDELDLPENF 
KNSGMLSFRVTPKKDRIDVFVTKSELSKDLNLEELADLGDISKMSPEDFFKTLE 
QSMLEKGDTDAHAKLAEIENMMDKATQEVVEENVSEEQPEKEVETIGYVHY 
VFDFDNIEAVVRFSQTIDFPIEA 



Sequence description: 

A] Length: 547 bp - 182 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-68 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-68 gene sequence. 
ATG start codon is preceded by an typical 

Shine-Dalgarno sequence. No obvious potential 

leader peptide 

sequence 



ID- 185 



Clone 3- 16b (ID- 122b) 

GGAAACCAACGGCCAGTACAATCGTCAAGGGTAGATTATCCTAAACGTAG 

TCGTGCCAAGATTGTAGAAGTTTATTTTAGACAAGCTTCTACTACTGATTA 

TTCTGGTGTTTACAAAGGTTACTATATTGACTTTGAAGCCAAAGAAACCCG 

GCAGAAAACTGCTATGCCTATGAAAAATTTTCATGCTCACCAAATAGAGC 

ACATGGCAAATGTATTACAGCAAAAAGGGATTTGCTTTGTCTTGCTTCATT 




TTTCCACACTTAAGGAAACCTATCTACTCCCTGCTAATGAGTTAATTTCATT 
TTATCAGATTGATAAAGGCAATAAATCAATGCCTATTGATTATATCAGAAA 
AAATGGATTTTTCGTAAAGGAGAGTGCCTTTCCTCAAGTCCCTTACTTAGA 
TATTATTGAAGAAAAATTATTAGGCGGTGATTACAATTAA 

GNQRPVQSSRVDYPKRSRAKIVEVYFRQASTTDYSGVYKGYYIDFEAKETRQ 
KTAMPMKNFHAHQIEHMANVLQQKGICFVLLHFSTLKETYLLPANELISFYQI 
DKGNKSMPIDYIRKNGFFVKESAFPQVPYLDIIEEKLLGGDYN* 



Sequence description: 

A] Length: 447 bp - 149 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 122 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 122 gene sequence. N- 
terminus has yet to be determined 



ID-186 



Clone 3- 17b (ID- 123b) 
C3GBL rid\jll) 



GGATCCTAAAAACGCTAAGGTTTATCAAAAAAATGCTGATCAATTTAGTG 
ACAAGGCAATGGCTATTGCAGAGAAGTATAAGCCAAAATTTAAAGCTGCA 
AAGTCTAAATACTTTGTGACTTCACATACAGCATTCTCATACTTAGCTAAG 
CGATACGGATTGACTCAGTTAGGTATTGCAGGTGTCTCAACCGAGCAAGA 
ACCTAGTGCTAAAAAATTAGCCGAAATTCAGGAGTTTGTGAAAACATATA 
AGGTTAAGACTATTTTTGTTGAAGAAGGAGTCTCACCTAAATTAGCTCAAG 
CAGTAGCTTCAGCTACTCGAGTTAAAATTGCAAGTTTAAGTCCTTTAGAAG 
CAGTTCCCAAAAACAATAAAGATTACTTAGAAAATTTGGAAACTAATCTTA 
AGGTACTTGTCAAATCGTTAAATCAATAG 

DPKNAKVYQKNADQFSDKAMAIAEKYKPKFKAAKSKYFVTSHTAFSYLAKR 
YGLTQLGIAGVSTEQEPS AKKLAEIQEFVKT YKVKTIFVEEGVSPKLAQA VAS 
ATR VKIASLS PLE A VPKNNKD YLENLETNLK VLVKS LNQ * 



Sequence description: 




A] Length: 433 bp - 144 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 123 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ED- 123 gene sequence. 

5 N-terminus has yet to be determined 



ID- 187 

10 

Clone 3-46/47 (ID- 130b) 

1 5 XrGA AAAA* ACT* C ATCG ATTT AAA AA AACT AC A AA AAGC AT ACGCCTC AG A 
AACTGTTTTAAATAATATTAATTTGGAGGTGTTTAAAGGAGAAATAATTGG 
ATTAATAGGACCCTCTGGAGCAGGGAAATCTACCTTGATTAAAACTATGCT 
TGGCATGGAAAAAGCAGATAAGGGAACAGCTCTTGTTCTTGATACTCAAA 
TGCCAGATCGTAATATTTTAAATCAAATTGGCTATATGGCTCAATCTGATG 

20 CCTTACACGAGTCTTTAACTGGCTTAGAAAATTTATTATTCTTTGGAAAAA 
TGAAAGGTATTCAAAAAACTGAATTAAAACAGCAGATAACTCATATTTCT 
AAAGTAGTAGATCTAGAAAACCAACTTGATAAATTTGTCTCAGGTTACTCA 
GAAGGTATGAAAAGACGGCTTTCTCTAGCCATCGCCCTACTTGGAAACCCC 
ACAGTTTTAATCCTAGATGAACCTACCGTTGGAATTGATCCATCCTTGAGG 

25 AGAAAAATCTGGCAAGAGCTAATTAATATTAAGGATGAAGGACGTTCTAT 
CTTTATTACAACCCACGTTATGGATGAAGCAGAATTAACAAGTAAGGTTGC 
ACTACTATTACGTGGAAACATTATTGCCTTTGATACTCCATTACATTTAAA 
AAAACAATTTAATGTGAGTACTATTGAGGAAGTTTTCTTAAAAGCTGAAGG 
AGAATAA 

MKKVIDLKKLQKAYASETVLNNINLEVFKGEIIGLIGPSGAGKSTLIKTMLGME 
KADKGTALVLDTQMPDRNILNQIGYMAQSDALHESLTGLENLLFFGKMKGIQ 
KTELKQQITHISKVVDLENQLDKFVSGYSEGMKRRLSLAIALLGNPTVLILDEP 
TVGDDPSLRRKIWQELENIKDEGRSIFITTHVMDEAELTSKVALLLRGNIIAFDTP 
35 LHLKKQFNVSTIEEVFLKAEGE* 



Sequence description: 



40 A] Length: 717 bp - 239 aa (Possible full-length sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 130 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID- 130 gene sequence. ATG 
start codon is preceded by a possible 



Shine-Dalgamo. No obvious potential leader 
peptide sequence 



ID- 188 



Clone 3-83b (ID- 144b) 

10 (SBSL ZPD rid:^?l) 

ATGGTACAAATGATACATGATATGATTAAAACAATTGAGCATTTTGCTGAG 

ACACAAGCTGATTTTCCAGTGTATGATATTTTAGGGGAAGTCCATACTTAT 

GGACAACTTAAAGTAGACTCTGACTCTCTAGCTGCTCATATTGATAGCCTA 

GGCCTTGTTGAAAAATCACCTGTCTTAGTATTCGGTGGTCAAGAATATGAA 

1 5 ATGTTGGCG AC ATTTGTTGCTTT A AC A A AGTC AGGGC ATGCTT AT AT ACCG 
GTTGACCAACACTCTGCTTTGGATAGAATACAGGCTATTATGACAGTTGCT 
CAACCAAGCCTTATCATTTCAATTGGTGAATTTCCTCTTGAAGTTGATAAT 
GTCCCAATCCTAGACGTTTCTCAAGTTTCAGCTATTTTTGAAGAAAAGACT 
CCTTATGAGGTAACACATTCTGTTAAAGGTGATGATAATTACTATATTATT 

20 TTCACTTCAGGGACTACTGGTTTACCAAAAGGTGTGCAAATTTCACATGAC 
AATTTATTGAGCTTTACAAATTGGATGATTTCTGATGATGAGTTTTCAGTTC 
CTGAAAGACCGCAAATGTTGGCTCAACCC 

MVQMIHDMIKTIEHFAETQADFPVYDILGEVHTYGQLKVDSDSLAAHIDSLGL 
25 VEKSPVLVFGGQEYEMLATFVALTKSGHAYIPVDQHSALDRIQADvITVAQPSL 
HSIGEFPLEVDNVPILDVSQVSAIFEEKTPYEVTHSVKGDDNYYIIFTSGTTGLP 
KGVQISHDNLLSFTNWMISDDEFSVPERPQMLAQP 



30 Sequence description: 



A] Length: 592 bp - 197 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
35 identified downstream of the ID- 144 gene which was identified by LEEP, 

during cloning and sequence analysis of the full-length ID- 144 gene sequence. 
Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. No obvious 
leader peptide sequence 
40 This orf is not in frame with nuc 



ID- 189 



Clone 3-86b (ID- 145b) 

li&^AA^i^CGTTATGAAGATGAAGGTAAATTCCAGCGTAAGATGAC 
CAGTCGTCATCTCTTTATGTTATCGCTAGGTGGTGTTATCGGGACTGGGCTT 
TTCTTGAGTTCAGGTTATACCATTGCACAGGCTGGTCCGCTTGGAGCTGTG 
CTGTCTTATTTGATTGGTGCCGTTGTGGTTTATTTGGTCATGCTATCACTTG 

1 0 GGGAATTGGCGGTTGCCATGCCGGTGACGGGGTCATTCC ACACTTATGCCA 
CTAAGTTTATCAGTCCTGGAACAGGTTTTACTGTTGCTTGGCTATATTGGAT 
TTGTTGGACGGTCGCCTTGGGGACTGAATTTTTAGGTGCTGCCATGCTGAT 
GCAGCGCTGGTTCCCAAATGTGCCGGCTTGGGCATTTGCTTCCTTTTTTGCC 
CTTGTGATTTTTGGTTTAAATGCTCTTAGCGTACGCTTTTTTGCAGAAGCAG 

1 5 AGTCTTTCTTCTC AAGTATT AAGGTT ATTGCTATC ATTATCTTT ATT ATCTTG 
GGCTTAGGTGCTATGTTTGGTCTAGTTTCCTTTGAAGGTCAGCACAAGGCT 
ATTCTCTTCACTCATCTGACTGCCAATGGTGCCTTTCCAAATGGTATCGTTG 
CAGTTGTCTCAGTCATGTTGGCTGTTAACTATGCCTTCTCTGGTACTGAGTT 
AATTGGTATTGCGGCTGGTGAAACGGATAATCCCAAAGAAGCTGTACCAA 

20 GGGCTATTAAAACGACAATCGGTCGCTTGGTTGTTTTCTTTGTACTGACAA 
TTGTTGTCCTAGCTTCGCTATTGCCAATGAAAGAGGCAGGCGTATCCACAG 
CACCATTCGTTGATGTCTTTGACAAGATGGGAATCCCTTTTACGGCGGATA 
TCATGAACTTCGTTATCTTGACAGCCATCCTGTCTGCTGGTAACTCAGGTCT 
CTACGCATCAAGCCGTATGCTCTGGTCCCTTGCCAATGAAGGTATGTTGTC 

25 AAAATCTGTTGTGAAAATCAATAAACACGGTGTCCCAATGCGTGCTCTTCT 
CTTGTCAATGGCAGGAGCAGTGCTGTCGCTCTTTTCAAGTATTTACGCTGC 
AGACACAGTTTATCTAGCCTTGGTTTCAATCGCGGGCTTTGCTGTTGTTGTC 
GTATGGCTAGCCATTCCAGTCGCACAAATCAATTTCCGCAAGGAATTC 

30 MENHRYEDEGKFQRKMTSRHLFMLSLGGVIGTGLFLSSGYTIAQAGPLGAVL 
S YLIG A V V V YLVMLS LGEL A V AMP VTGSFHT Y ATKFIS PGTGFT V A WL YWIC 
WTVALGTEFLGAAMLMQRWFPNVPAWAFASFFALVIFGLNALSVRFFAEAES 
FFSSIKVIAIIIFIILGLGAMFGLVSFEGQHKAILFTHLTANGAFPNGrv'AWSVM 
LAVNYAFSGTELIGIAAGETDNPKEAVPRAIKTTIGRLVVFFVLTIVVLASLLPM 

35 KEAGVSTAPFVDVFDKMGIPFTADIMNFVILTAILSAGNSGLYASSRMLWSLA 
NEGMLSKSVVKINKHGVPMRALLLSMAGAVLSLFSSIYAADTVYLALVSIAGF 
AVVVVWLAEPVAQINFRKEF 



40 

Sequence description: 



A] Length: 1 126 bp - 393 aa (partial gene 



sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID- 145 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-145 gene sequence. 
Putative ATG start codon is preceded by a 
typical Shine-Dalgarno sequence. Possesses a 
possible leader peptide sequence. 



ID- 190 



Clone 3-94b 

(Se&j3D Aid] J&S) 

TCAGAAAATGCAGAGGCAGCAACGGTTGCCACAAACTTGGTTACCAAAGG 

AGCTAATGTCATTATCGGACCAGCAACATCGGGTGCAGCTGCATCTTCAAC 

TCCAAAAGTAAATGCAGCAGCAGTTCCAATGATTGCACCTGCTGCGACAC 

AAGACAATTTAGTCTATGGTTCTGATGGAAAAACCTTAAATCAGTATTTCT 

TCCGAGCTACTTTTGTCGATAATTATCAAGGAAAGCTATTGTCTCAGTATG 

CTACAGACAACCTTAAAGCTAAAAAAGTTGTTCTATTTTATGATAATTCAT 

CAGATTACTCAAAGGGGGTAGCAAAATCATTTAAGGAAAGTTATAGTGGA 

AAAATTGTTGATAGTATGACATTCTCCGCTGGTGATACTGATTTCCAAGCG 

TCATTGACTAAGTTGAAAGGGAAAGAATATGATGCTATTGTGATGCCAGG 

TTACTATACCGAGACAGGATTAATAGTTAAGCAAGCGCGTGATTTAGGTAT 

CTCTAAACCGGTTCTTGGGCCTGATGGTTTTGATAGTCCGAAATTTGTGCA 

ATCGGCAACACCTGTAGGAGCTTCAAACGTTTATTATTTGACAGGTTTCAC 

TACACAAGGATCAACCAAAGCTAAAGCT 

(SER 30 m?: a?0 

SENAEAATVATNLVTKGANVHGPATSGAAASSTPKVNAAAVPMIAPAATQD 

NLVYGSDGKTLNQYFFRATFVDNYQGKLLSQYATDNLKAKKVVLFYDNSSD 

YSKGVAKSFKESYSGKIVDSMTFSAGDTDFQASLTKLKGKEYDAIVMPGYYT 

ETGLIVKQARDLGISKPVLGPDGFDSPKFVQSATPVGASNVYYLTGFTTQGST 

KAKA 



Sequence description 



A] Length: 637 bp - 231 aa (partial sequence) 



B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-149 gene which was identified by LEEP, during 
cloning and sequence analysis of the full-length ID-149 gene sequence. N- and 
C-termini have yet to be determined 



ID-191 



10 



Clone 2-c94b (ID- 153b) 



i 5i 
TT< 



TGGGACTTAAAGACCATGCTTTAGTCTATCCATTTTCATTATCTGGGGGG 
1 5 C A AA AGC AACGTGTCGC ACTAGCTCGTGCG ATGATG ATTGATCC AC AG ATT 
ATTGGTTATGATGAGCCAACTAGCGCTCTTGATCCAGAGTTGCGTCAAGAA 
GTAGAAAAACTAATTTTACAAAATAGAGAAACAGGTATGACACAAATTGT 
AGTAACACATGATCTTCAATTTGCTGAAAGTATATCTGATACGATTCTCAA 
AATTAATCCTAAGTAG 

CSEfl- t>0\J3f) 

MC^KDHALVYPFSLSGGQKQRVALARAMMIDPQIIGYDEPTSALDPELRQEV 
EKLILQNRETGMTQTVVTHDLQFAESISDTILKINPK* 



20 



25 



Sequence description 



A] Length: 270 bp - 90 aa (partial sequence) 
30 B] This gene sequence was not identified using the LEEP system. It was 

identified upstream of the ID- 153 gene which was identified by LEEP, during 
cloning and sequence analysis of the ID- 153 gene sequence. 
N-terminus has yet to be determined 



35 



ID- 192 



Clone 2-c lb (ID- 155b) 

40 

ATGACTAATATCTCAGATGTTCCAAAAGCTATTAGAACACAGGCACAGTAT 
GTTCTCTTGGGAATGAGAGTTATGGATCAGTCGGTATTACCGAAAACATAT 
AATTCAAAAGAACCTTATTTGAAACCAGATATGATTTATATTCATGATAGA 



AGACAAGAGACAATGCTTAAAATCACTCAAGAAATAGAAATGGAGCATTG 
A 

](^MlS^P^m^QAQYVLLGMRVMDQSVLPKTYNSKEPYLKPDMIYfflDRR 
5 QETMLKITQEIEMEH* 



Sequence description 

10 

A] Length: 204 bp - 68 aa (partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID- 155 gene which was identified by LEEP, during 

15 cloning and sequence analysis of the ID- 155 gene sequence. 

ATG start codon is preceded by a potential typical Shine-Dalgarno sequence. 
Has a 

typical leader peptide. N-terminus has yet to be 
verified 

20 



ID- 193 

25 Clone 2-54altb (ID- 1 72b) 

AAGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCTTGGGGAATATAAATT 
TGGATTTCATGACGATGTAAAGCCAATTTATTCTACGGGAAAAGGTCTAAA 
TGAGGCTGTTATTCGTGAGTTATCTGCAGCTAAGGGTGAACCTGAGTGGAT 

30 GTTGGACTTTCGTCTAAAATCCTTGGAAACGTTTAATAAAATGCCGATGCA 
GACCTGGGGAGCAGATTTATCAGATATTGATTTTGATGATATTATTTATTA 
TCAAAAAGCATCTGATAAACCTGCGCGTGATTGGGATGATGTTCCAGAAA 
AAATCAAAGAAACTTTTGAAAGAATTGGGATTCCAGAAGCTGAAAGAGCC 
TATCTTGCAGGAGCATCAGCACAATATGAATCAGAAGTAGTTTATCACAAT 

35 ATGAAAGAAGAATATGATAAGCTGGGTATTGTTTTTACGGATACTGACTCT 
GCACTTAAAGAGTACCCAGAGCTATTCAAAAAATATTTTGCTAAACTTGTC 
CCTCCAACAGATAATAAATTAGCTGCTCTGAACTCTGCTGTATGGTCAGGT 
GGAACATTTATTTATGTTCCTAAAGGTGTTAAGGTGGATATTCCACTTCAA 
AGTTACTTGCGTATTAATAATGAAAATACTGGACAATTTGAACGTACTCTC 

40 ATTATTGTTGATGAGGGAGCAAGTGTTCACTATGTTGAAGGTTGTACCGCC 
CCAACTTATTCTTCAAATAGTTTACATGCAGCTATAGTTGAAATTTTTGCAC 
TTGATGGAGCTTATATGCGCTATACGACTATTCAAAATTGGTCCGATAATG 
TCTATAATTTAGTGACAAAACGTGCTACCGCTAAAAAAGATGCAACAGTT 
GAGTGGATAGATGGAAATCTAGGAGCTAAAACAACAATGAAATACCCATC 



GGTTTACCTTGATGGTGAAGGAGCACGTGGCACGATGTTGTCTATTGCTTT 

TGCAAACAAAGGACAACACCAAGATACGGGTGCAAAGATGATTCATAATG 

CCCCCCATACTAGTTCATCCATTGTCTCTAAATCAATTGCTAAGGGTGGGG 

GAAAAGTTGATTATCGAGGTCAAGTGACATTTAATAAAGATTCCAAAAAA 

TCAGTGTCACATATAGAATGTGACACCATATTGATGGATGATATTTCAAAA 

TCAGATACCATACCGTTTAATGAGATTCATAATTCACAGGTTGCTTTAGAG 

CATGAAGCAAAGGTGTCTAAGATTTCTGAAGAGCAACTGTACTACTTGATG 

AGTCGAGGTTTATCTGAAGCTGAAGCAACAGAAATGATTGTTATGGGGTTT 

GTTGAGCCCTTTACGAAAGAATTACCAATGGAATATGCGGTAGAGTTAAA 

TCGTTTAATTTCCTATGAAATGGAAGGTTCAGTTGGTTAA 

l^AC^S^EbtG^VKFGFHDDVKPIYSTGKGLNEAVIRELSAAKGEPEWMLD 

FRLKSLETFNKMPMQTWGADLSDIDFDDIIYYQKASDKPARDWDDVPEKIKE 

TFERIGIPEAERAYLAGASAQYESEVVYHNMKEEYDKLGIVFTDTDSALKEYP 

ELFKKYFAKLVPPTDNKLAALNSAVWSGGTFIYVPKGVKVDIPLQTYFRINNE 

NTGQFERTLIIVDEGASVHYVEGCTAPTYSSNSLHAArVEIFALDGAYMRYTTI 

QNWSDNVYNLVTKRATAKKDATVEWIDGNLGAKTTMKYPSVYLDGEGARG 

TMLSIAFANKGQHQDTGAKMIHNAPHTSSSrVSKSIAKGGGKVDYRGQVTFN 

KDSKKSVSHffiCDTILMDDISKSDTIPFNEfflNSQVALEHEAKVSKISEEQLYYL 

MSRGLSEAEATEMrVMGFVEPFTKELPMEYAVELNRLISYEMEGSVG* 



Sequence description: 

A] Length: 141 1 bp - 469 aa (Possible full-length gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ED-72 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-72 gene sequence. 
No obvious Shine Dalgarno sequence upstream of 

TTG start codon insufficient sequence data). N 
terminus needs verification. 



ID- 194 

Clone 3-lb(ID-81b) 

ATGATAGAATTCTTTTCTAATATCAGAACAGAGATTCCGCAGATGCCTTTA 
CTTATCCATAGTTTGATTTTATCTGTCTTACCTTTTCTGATGTGGCTGACTTT 
GGTTAATAGAGATAAGCCTTTGTATAAAACTATTTGGAGTATCCTTTTAGG 
ACTTCAGTTAATTACGATTTATACTTGGTTTTTCTGGGCAAAATTGCCTTTA 



TCTGAAAGTCTTCCCCTTTACCATTGTCGAATAGGCATGTTTGTCGGTCTCT 
TA 

MEFFSNIRTEIPQMPLLfflSLILSVLPFLMWLTLVNRDKPLYKTIWSILLGLQLI 
TIYTWFFWAKLPLSESLPLYHCRIGMFVGLL 



Sequence description 

A) Length: 261 bp - 87 aa (partial gene sequence) 
10 B) This gene sequence was not identified using the LEEP system. It was identified 

downstream of the ID-81 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length BD-81 gene sequence. Sequence Characteristics: 
Possesses a potential leader peptide sequenceOrf is preceded by a potential Shine- 
Dalgarno sequence. 



ID- 195 

20 

Clone RS-55b 

AAGCTTGTGCAAAGTATTAAAGAGATAGGATTAGCTAATGCGCATTTATTA 
25 GCTGTTGCTCCGACAGGGTCAATCAGTTATCTTTCTTCTTGTACTCCGAGCC 
TTCAACCGGTTGTATCACCTGTCGAAGTACGCAAGGAAGGAGCACTGGGG 
AGGGTTTATGTAGCTGCTTATAAGATTGATGCAGATAATTATGTCTACTAC 
AAAAAAGGAGCTTATGAAGTGGGATCTGAGGCGATTATCAATATTGCAGC 
TGCCGCTCAAAAACACATTGATCAAGCTATTTCGTTAACGCTTTTCATGAC 
30 AGATCAAGCAACTACGCGAGATTTAAATAAAGCCTATATTCAAGCATTTA 
AACAAAAATGTGCCTCTATTTATTATGTACGAGTGAGACAGGACATCCTAG 
AAGGTAGCGAGAGTTATGATGATATGCTGGATGATTTCACTTCATCGGACT 
TAGAAGACTGTCAATCCTGCATGATTTAA 

^KLVQSIKEIGLANAHLLAVAPTGSISYLSSCTPSLQPVVSPVEVRKEGALGRV 
YVAAYKIDADNYVYYKKGAYEVGSEAIINIAAAAQKHIDQAISLTLFMTDQAT 
TRDLNKAYIQAFKQKCASIYYVRVRQDILEGSESYDDMLDDFTSSDLEDCQSC 
MI* 

40 



Sequence description: 



A] Length 486 bp - 162 aa (Partial sequence) 

B] This gene sequence was not identified using the LEEP system. It was identified 
upstream of the ID-87 gene which was identified by LEEP, during cloning and 
sequence analysis of the full-length ID-87 gene sequence. N-terminus to be 
determined. 



ID-196 

Clone RS-59(ID-90b) 

GTGAGGACATATATTACAAACTTGAATGGACATTCAATCACTAGTACAGC 

ACAAATAGCTCAAAACATGGTAACAGATATAGCAGTAAGCTTAGGTTTTC 

GTGAGCTGGGAATACATTCTTATCCGATTGATACTGATTCTCCTGAGGAAA 

TGAGTAAGCGTTTAGATGGAATCTGTTCCGGACTTAGAAAAAATGATATTG 

TCATATTTCAGACACCTACATGGAACACTACAACTTTTGATGAAAAATTAT 

TTCACAAATTAAAAATATTTGGTGTAAAGATTGTTATTTTTATACATGATGT 

TGTACCGCTAATGTTTGATGGAAATTTTTATTTGATGGATAGAACTATAGC 

TTATTATAATGAAGCAGATGTTTAATAGCCCCTAGTCAAGCAATGGTCGAT 

AAGCTT 

MRTTITNLNGHSITSTAQIAQNMVTDIAVSLGFRELGIHSYPIDTDSPEEMSKRL 
DGICSGLRKNDIVIFQTPTWNTTTFDEKLFHKLKIFGVKIVIFIHDVVPLMFDGN 
FYLMDRTIAYYNEADVLIAPSQAMVDKL 

Sequence description: 

A] Length: 414 bp - 138 aa(partial gene) 

B] This gene sequence was not identified using the LEEP system. It was 
identified downstream of the ID-90 gene which was identified by LEEP, 
during cloning and sequence analysis of the full-length ID-90 gene sequence. 
No obvious signal peptide, but a 

possible Shine Dalgarno sequence is present 
upstream of ATG start codon. C-terminus has yet 
to be determined. 



ID- 197 



Clone RS-59c (ED-90c) 



CATGGAAATGAAGTTGATGATGTTATTAGAAGGGCATTTGAATATAATCAC 
CTTATCTTTGCTTTTGATAATACCTGTCATAACAGAGAGTTAGTATTAGATA 
5 GCAATATCATTTCTCACACAACCTGTGAACAATTGATAAATTTAATGAAAA 
ATTTATCAGGCTCCATTATGTATTTGCTAGAGCAACAAAGAGAACAAACA 
AGTAATGAAACAAAAGAGCGTTATAAAGAAATATTAGGAGGGTATGGAA 
ATGCCTAA 

10 HGNEVDDVIRRAFEYNHLIFAFDNTCHNRELVLDSNIISHTTCEQLINLMKNLS 
GSIMYLLEQQREQTSNETKERYKEILGGYGNA* 



15 Sequence description: 

A] Length: 261 bp - 87 aa(partial gene sequence) 

B] This gene sequence was not identified using the LEEP system. It was 
identified upstream of the ID-90 gene which was identified by LEEP, during 

20 cloning and sequence analysis of the full-length ID-90 gene sequence. N- 

terminus has yet to be determined 



ID-198 
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Clone RS-70b (ID-93b) 

icAVf^T^^A^T^TGTATTTGAAGACGTAGCCACCCAGTCAAATATGACT 
30 GGGAAGATTTTTAGTATGTCTAAAGAAGAGTTGTCATATTTACCCGTTATT 
AAACTTTTTAAGAATCAAGGTGTATACAACGGCTTGATTGGTCTATTCCTC 

TCAATGTGTTGCTAGTTGCTGTTTATGGTGCTTTGACAGTTGATAAAAAAA 
TCTTATTAAAACAGGGTGGTTTACCTATATTAGCTCTTTTAACATTCTTATT 
35 TTAA 

TFLYYVFEDVATQSNMTGKIFSMSKEELSYLPVIKLFKNQGVYNGLIGLFLLY 
GLYISQNQEIVAIFLINVLLVAVYGALTVDKKILLKQGGLPILALLTFLF* 
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Sequence description: 



A] Length: 312 bp - 104 aa (partial gene sequence) 



B] This gene sequence was not identified using the LEEP system. It 
was identified upstream of the ID-93 gene which was identified by 
LEEP, during cloning and sequence analysis of the full-length ID-93 
gene sequence. 

N-terminus has yet to be determined 



AGTGATGCAATACAAGAAACGATTCTTTTATCACAAGAGGCGGAGCAACT 

AGGCTATCATCAATTTTGGGTGGCTGAACATCACGGTGTTAAGGCATTCAG 

TATTAGCAATCCAGAATTAATGATAATGCATTTGGCTAACCAGACTAAATC 

TATCAAAATTGGCTCTGGAGGTATAATGCCTCTGCACTATAGTAGTTTTAA 

ACTCGCGGAGACTCTCAAGACATTAGAGACATGTCATCCGAATCGAGTAA 

GTATTGGTTTAGGAAATTCACTAGGGACAGTTAAAGTTTCAAATGCACTTC 

GTAGCTTACATAAAGCACATGATTACGAAGAGGTACTGGAGGAATTGAAG 

TCATGGCTTATTGATGAATCATCCAGTAAGGAACCATTAGTTCAACCGACT 

CTTTCTAGCTTCCCAGACTTATATGTGTTGGGGAGTGGTCAAAAATCAGCT 

TATTTAGCGGCTAAACTTGGCTTAGGCTTTACCTTGGGTGTTTTTCCTTTTA 

TGGACAAAGACCCATTGACAGAAGCTAAA 

MKLSVLDYGLIDYGKTASDAIQETILLSQEAEQLGYHQFWVAEHHGVKAFSIS 
NPELMIMHLANQTKSIKIGSGGIMPLHYSSFKLAETLKTLETCHPNRVSIGLGN 
S LGT VKVSN ALRS LHKAHD YEE VLEELKS WLIDESS S KEPLVQPTLS SFPDL Y V 
LGSGQKSAYLAAKLGLGFTFGVFPFMDKDPLTEAK 



Sequence description: 



A] Length: 588 bp - 196 aa (partial) 

B] This gene sequence was not identified using the LEEP system. It 
was identified downstream of the ID-93 gene which was identified by 
LEEP, during cloning and sequence analysis of the full-length ID-93 
gene sequence. No obvious signal peptide, but Shine Dalgarno 
sequence upstream of the ATG start codon. 



ID- 199 



Clone RS-70c (ID-93c) 




CCTTGATTATGGGCTTATTGATTATGGAAAAACTGCA 
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Figure 3 
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Figure 4 



pTREP-Nuc vectors allow cloning of genomic DNA into each frame 
with respect to the nuclease gene 
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